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Preface 


Serge Lang was an iconic figure in mathematics, both for his own important work 
and, perhaps even more crucially, for the indelible impact he left on the field, 
and on his students and colleagues. It would be difficult to find a mathematician 
who came of age in the past forty years, who had not been exposed to Serge’s 
articles, monographs, and textbooks. Serge’s writing shaped the mathematical 
perspectives of all who came in contact with them. Many were challenged by 
the glimpses of open problems and conjectures that Serge interweaved with his 
expositions of established subjects. Serge’s exposition invariably transcended our 
discipline’s preference for brevity and perfection, which often obscures the intuition 
underlying the subject. Serge was never one to conform. 

One of Serge’s uplifting qualities was his openness to new areas of mathematics 
and his concurrent willingness, even eagerness, to learn novel concepts and 
techniques. He was constantly reinventing himself, while sharing his accumulated 
wisdom with students and young mathematicians. Over the course of his career, 
he traversed a tremendous amount of mathematical ground. As he moved from 
subject to subject, he found analogies that led to important questions in such areas 
as number theory, arithmetic geometry, and the theory of negatively curved spaces. 
Lang’s conjectures will keep many mathematicians occupied far into the future. 

This memorial volume contains articles in a variety of areas of mathematics, 
attempting to represent Serge’s breadth of interest and impact. We are happy to 
publish here (for the first time) Serge’s final paper, The heat kernel, theta inversion, 
and zetas on Y\G/K, written jointly with one of us (J. Jorgenson). Except for 
that one article, which was left in the form it assumed just before Serge’s passing, 
every other entry here was thoroughly refereed. We thank all the authors for their 
contributions to the volume and for their willingness to put up and comply with our 
demands for revision. Thanks also to the anonymous referees for their excellent and 
timely work. 

We, the editors, are pleased to be a part of this production, especially since we 
were all fortunate enough to know Serge personally. We thank Stacey Croomes, the 
math administrator at Caltech, for her invaluable help in organizing the receipt of 
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the articles, the refereeing process, and the revisions. We are grateful to Ann Kostant 
and Elizabeth Loew of Springer for their enthusiasm and helpful advice during the 
many months of editorial preparation. It took a village to produce this volume. 
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Introduction 


John Tate 


This introduction is meant as a brief account of Serge Lang’s life and his enormously 
varied contributions to mathematics. Much more about this remarkable man can be 
found in two articles in the Notices of the AMS by Jay Jorgenson and Steven G. 
Krantz. The first of these, “Serge Lang, 1927—2005” (May 2006) contains a fuller 
account of Lang’s life than we can give here, and includes memories of Serge 
by twenty-two of his friends, students, colleagues, and even some whom Serge 
might have seen as adversaries. Read together, these short pieces give a vivid image 
of Lang. The second article, “Mathematical Contributions of Serge Lang” (April 
2007), contains an overview of his research, followed by discussions of its different 
aspects by seven colleagues in the various fields. These articles, and conversations 
with Lang’s friends Dick Gross, Dinakar Ramakrishnan, and Ken Ribet, have been 
of great help to me in writing this introduction. 

Lang spent his childhood in Saint-Germain-en-Laye, a western suburb of Paris 
famous for its chateau and long terrace with a view over the valley of the Seine and 
Paris in the distance. Lang’s teen years were spent in quite different surroundings. 
After emigrating with his family to Los Angeles, he attended Beverly Hills High 
and Caltech, graduating in 1946 with a BA in physics. 

He then did a year and a half of military service with the U.S. Army in Europe. 
This was of great help to him in his future career, for he served in a clerical position 
in which he learned to type at incredible speed. 

Next, Serge enrolled in the graduate philosophy program at Princeton. For- 
tunately for mathematics he was disappointed by the quality of the philosophy 
seminars, and managed to switch to the math graduate program the following year. 
I don’t know why he chose mathematics, but the Princeton math program was an 
outstanding one and student morale was very high. 
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Though he was not particularly well prepared, Serge plunged right in. For 
example, knowing little number theory, he attended Emil Artin’s seminar on class 
field theory during his first year and was fascinated. Math, especially algebra and 
number theory, were the subjects for him! He soon became one of Artin’s Ph.D. 
students along with me and a few others that Artin had brought with him from 
Indiana. We all felt very fortunate to have Artin as our advisor. In the foreword to 
his collected works, Lang writes, “J take this opportunity to express once more my 
appreciation for having been Artin’s student. I could not have had a better start in 
my mathematical life.” 

Lang got his Ph.D in 1951 with a thesis on quasi-algebraic closure in which he 
proved that a field complete in a discrete valuation with algebraically closed residue 
field is quasi-algebraically closed, and that the same is true for several kinds of dense 
subfields of such a field. Another result in his thesis is a key ingredient in the proof 
of the Ax—Kochen theorem,! which can be viewed as a corrected version of Artin’s 
conjecture that p-adic fields have property C2. For each degree d, this is true for all 
but a finite set of primes p, but not necessarily for all, as Artin had guessed. 

Lang stayed in Princeton for two more years, with postdoc positions at the 
university and at the Institute for Advanced Study. Then, after two years of an 
instructorship at the University of Chicago, where he interacted with Weil and 
his circle, he accepted a permanent position at Columbia University. He stayed 
there for the next fifteen years except for a Fulbright Fellowship year in Paris 
during 1957-58. In 1970, Lang resigned his position at Columbia in protest of the 
university’s handling of student unrest during 1968-69. After visiting professorships 
at Princeton and Harvard, he accepted a permanent position at Yale. He retired from 
Yale in 2005, a few months before his death. That is a bare-bones account of Lang’s 
life and the way he got into mathematics. 

Serge Lang contributed to mathematics in so many ways that it’s hard to know 
where to begin. Let’s start with some remarks on his research, with no attempt to 
cover it completely. He published his Collected Papers in five volumes with Springer 
in 2000. They contain all of his research papers through 1999, together with reprints 
of a few of his Springer Lecture Notes and two of his books that were out of print. 
There are also some interesting accounts of some special topics on which Lang held 
strong views, especially in Volume IV. 

In a brief foreword in Volume 1, Lang gives his own classification of his work 
into periods as follows: 


1. 1951-1954 Thesis on quasi-algebraic closure and related matters. 
2. 1954-1962 Algebraic geometry and abelian (or group) varieties; geo- 
metric class field theory.” 


' As Deligne pointed out, I misstated the Ax—Kochen theorem in my Notices article on Lang’s early 
work, by interchanging “prime p” and “degree d.” I hope this “senior moment” misled no one. 


This work was the beginning of higher-dimensional class field theory and earned Lang a Cole 
Prize in 1959. 
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3. 1963-1975 Transcendental numbers and Diophantine approximation on 
algebraic groups. 

4. 1970 First paper on analytic number theory—jump to Jorgenson— 
Lang. 

5. 1975 SL2(R)—jump to Jorgenson—Lang. 

6. 1972-1977 Lang-—Trotter Frobenius distributions. 

7. 1973-1981 Modular curves, Kubert—Lang modular units. 

8. 1974, 1982-1991 Diophantine geometry, complex hyperbolic spaces, and Nevan- 


linna theory. 

9. 1985, 1988 Riemann—-Roch and Arakelov theory. 

10. 1992—2000+ Jorgenson—Lang (analytic number theory and connections 
with spectral analysis, heat kernel, differential geometry, Lie 
groups, and symmetric spaces). 


The arrangement of the 2007 Notices article fits quite well with Lang’s scheme. 
Here is a list of the authors in the order of their appearance, followed by the periods 
of Lang’s work they discuss. Tate 1,2; Buium 1,2,3; Waldschmidt 3; Rohrlich 6,7; 
Vojta 8,9; Jorgenson 10; and Kim, who wrote on the theme of the fundamental group 
in Lang’s work, rather than on a specific period. 

Lang wrote over 70 research papers and proved many important theorems, but of 
at least equal significance were his conjectures, his points of view, and his way of 
looking at things. Waldschmidt expresses this well (loc. cit.), writing about Lang’s 
work on transcendental numbers: 


With his outstanding insight and his remarkable pedagogical gifts, Lang comes into the 
picture and contributes to the subject in at least two very different ways: on the one hand, 
he simplifies the arguments (sometimes excessively) and produces the first very clear proofs 
which can be taught easily; on the other hand, he introduces new tools, like group varieties, 
which put the topic closer to the interests of many a mathematician. 


Waldschmidt concludes his article as follows: 


Among the contributions of Lang to transcendental number theory (also to Diophantine 
geometry), the least are not his many conjectures which shed a new light on the subject. 
On the contrary, he had a way of considering what the situation should be, which was 
impressive. Indeed, he succeeded in getting rid of the limits from the existing results and 
methods. He made very few errors in his predictions, especially if we compare them with the 
large number of conjectures he proposed. His description of the subject will be a guideline 
for a very long time. 


As Vojta points out, the title of Lang’s magnum opus, Fundamentals of Dio- 
phantine Geometry suggests that Serge’s outlook on number theory was decidedly 
geometric. Mazur puts this beautifully in concluding his memory of Serge article in 
the Notices: 


Over the decades of mathematics, Lang was led, more specifically, by an over-arching 
vision, which he pursued through the agency of various fields of mathematics. The vision, 
baldly put, is that geometry is an extraordinarily striking dictator of qualitative dipohantine 
behavior. The still open Conjecture of Lang in higher dimensions continues to serve as a 
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guiding principle to the way in which the grand subjects of geometry and number theory 
meet, just as Serge himself served as an inspiror of generations of mathematicians, and a 
spokesman for intellectual honesty. 


The conjecture of Lang to which Mazur refers is easy to state. A projective 
algebraic variety V defined over a number field F C C is Mordellic if and only 
if the corresponding complex space V (C) is hyperbolic. Here Mordellic means that 
for each finite extension field FE of F,, the set V(E) of points of V with coordinates 
in F is finite. Hyperbolic meant for Lang, when he made the conjecture in 1974, 
that the Kobayashi semidistance on V(C) is actually a distance, but we now know, 
thanks to Brody (1978), that this property is equivalent to there being no nonconstant 
holomorphic map C — V(C). A Riemann surface is hyperbolic if and only if its 
genus is > 2, so that for curves, Lang’s conjecture is equivalent to the famous 
Mordell conjecture proved by Faltings in 1983. In higher dimensions the conjecture 
is still open, though it has been proved for closed subvarieties of abelian varieties. 

In the 1980s Lang thought deeply about the Mordellic—hyperbolic relationship 
and introduced plausible variants of the above conjecture which have turned out 
to have very interesting unexpected implications, such as the existence of a bound 
B(g, F) depending only on g and F for the number of rational points on a curve of 
genus g > 2 defined over a number field F’. 

Serge led a regular life. During the winter holidays he visited his sister in Los 
Angeles. He spent the early summer in Europe and July—August in Berkeley, where 
he had an apartment. In Europe he spent a month in one place, Paris in the early 
years, Bonn later in his life, but also visited regularly other mathematical centers, 
Zurich, Berlin, Moscow.... In Berkeley he interacted with the large community of 
resident and visiting mathematicians. 

Lang was an effective communicator, an excellent source of mathematical news. 
Dick Gross likens his gathering and distributing information to the cross-pollination 
of a bumblebee. Serge kept in touch with friends in many places, not only in person, 
by traveling, but by phone. If he had a question or thought for anyone anywhere 
in the world, he just picked up the phone. When Yale was considering an offer to 
Serge, I remember warning the department that if he accepted, its phone bill would 
at least double, but that in fact, the phoning he would be doing was just one more 
reason for making the offer. 

A more important reason for Yale’s doing so is that Lang was an excellent and 
caring teacher. This was recognized by his being awarded the Dylon Hixon Prize 
for Teaching Excellence in the Natural Sciences at Yale. We were reminded of the 
esteem in which his former students held him by the testimony of so many of them 
at the memorial meeting for Lang at Yale in February 2006. One of them, Anthony 
Petrello, announced the establishment of a Yale fund for an annual prize in Lang’s 
honor which he was launching with a large seed contribution, and the promise of 
matching funds. 

Another memorial to Lang is the Serge Lang Undergraduate Lecture Series at 
Berkeley. There, when students returned to classes at the end of August, Serge 
often gave talks to the Math Undergraduate Student Association (MUSA). His 
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talks over the years were incorporated into his 1999 volume “Math talks for 
Undergraduates.” These talks became formalized, and each year from 1999 to 2005 
Serge gave a MUSA lecture at 4pm on the first day of classes. The lecture on 
August 25, 2005, “Weierstrass—Dirac Families,” shortly before his death, was one 
of Serge Lang’s last mathematics talks. In response to MUSA’s request to somehow 
continue this tradition, the Department of Mathematics inaugurated the Serge Lang 
Undergraduate Lecture Series, each fall inviting someone to give a lecture for 
undergraduates. Ken Ribet made this happen and Anthony Petrello contributed a 
major part of the initial funding. 

Serge was known not only for his support of his students, but also of his younger 
colleagues at the start of their careers. John Coates thought one of Serge’s most 
remarkable qualities was his unstinting support of young mathematicians. Barry 
Mazur, after recounting his first encounter with Serge, writes: 


And Serge did this sort of thing through the decades, with many of the young; he would 
proffer to them gracious, yet demanding, invitations to engage as a genuine colleague—not 
teacher to student, but mathematician to mathematician; he did all this naturally, and with 
extraordinary generosity and success. 


Lang was awarded the 1999 AMS Leroy P. Steele Prize for Mathematical Expo- 
sition “for his many mathematics books.” The amount of mathematical knowledge 
that has been made accessible to students of all ages all over the world by Lang’s 
more than 40 books is amazing to contemplate. Their range both in subject and in 
level is astonishingly broad. Most were new and modern for their time with Lang’s 
insistence on functoriality and axiomatization. He was almost unique in the way 
he regularly learned new topics throughout his life, topics often not close to his 
main interests (algebra and number theory), and wrote textbooks on them (such 
as “Differentiable Manifolds” and SL2(R)), thereby influencing new generations of 
students pursuing those fields. He brought excitement to his books that challenged 
readers to rise above themselves by tackling them. 

Lang’s Algebra is a classic, still the best reference book in algebra in print today. 
The first edition in 1965 has been kept up-to-date with new editions and revisions. 
The latest is the corrected fourth printing of the revised third edition (2004). For 
Lang, the important thing in a book is its timeliness, and its global aspects, such as 
arrangement of topics, and degree of abstraction. He did not worry much about an 
occasional error in a proof, and was widely criticized for this. Given the short time 
he spent writing a book, there are relatively few of these oversights, and when he 
became aware of one he was highly attentive to its correction in the next edition 
or the next printing. These oversights could be useful. I remember a few times first 
recognizing a student who turned out to be very strong when s/he came for help in 
understanding one of Lang’s faulty arguments. 

Lang was driven to publish. In addition to his own writing, he saw to the 
publication of at least two books which were not his own, namely Class Field Theory 
by Artin and me, and the Collected Papers of Emil Artin. He should really have 
been included among the authors of the former, for the main part of it is essentially 
Serge’s rewriting of his own notes from the 1951-52 Princeton seminar, and the 
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book’s publication would never have happened without Serge’s persistent prodding 
and attending to the details. In the case of Artin’s papers, edited by Serge Lang 
and John T. Tate, it was Lang who did all the work in collecting the material and 
preparing it for publication. He assigned me the trivial task of writing a paragraph 
on Artin’s conjectures to justify my sharing the editorship. In the first printing, 
Addison-Wesley made the mistake of including Lang’s and my names on the cover, 
below Artin’s. Lang hit the ceiling, insisting that the whole printing be redone. 
Fortunately a solution was found. Our names are masked by a decorative strip which 
was added to the cover, although they are discernible if you know they are there. 

To touch on some of Serge’s personal characteristics: He did things fast. He typed 
fast, ate fast, drove fast, walked fast. I still remember trying to keep up with him on 
walks across campus. On a wintry day he used only earmuffs and gloves (no coat) 
to keep warm. He seemed to have a hummingbird-like metabolism. Besides being 
fast, he did not waste time. He hated small talk. 

Serge cared about quality. His possessions and gifts were chosen thoughtfully. 
He had impeccable taste and sought out fine things. I think of his leather jacket, his 
gloves, his silk scarf and clothing generally, but also of his appliances, his collection 
of rugs, and his furniture. In the summer of 1958, after a year in Paris, we each 
went through Copenhagen to purchase Danish furniture. Serge chose from the best 
collections, I from the mid-range. 

Except for an occasional sip of a vintage wine, Lang did not drink, but he insisted 
on supplying a fine cognac for Yale’s (and often Harvard’s as well) winter holiday 
party eggnog. The batter for the crepes suzettes he served was beaten by hand with 
a whisk, never with a mechanical mixer. 

It is hard to imagine that Serge’s many mathematical activities left time for 
much else, but he was an accomplished musician, playing piano and lute, and 
in later years he spent much time and energy exposing cases of what he saw as 
scientific, editorial or bureaucratic irresponsibility, by compiling and distributing, 
at considerable personal expense for photocopying and postage, collections* of 
relevant original documents, in his aim to create transparency and promote clear 
understanding of the facts of a case. He also supported many people and causes he 
found to be worthy, in ways ranging from enthusiastic encouragement to assistance 
in funding. 

In Serge’s battles with the establishment, his positions were almost always 
fundamentally sound, though the extremes to which he went and the vehemence 
with which he pursued his points of view may have prevented some from realizing 
this. Serge could be difficult. To him things were black or white. To compromise 
was not his way. 

Serge Lang devoted his life to advancing mathematics, to teaching, and to 
fighting for honesty in science and politics. 


3These collections were known as “files” to the people on Lang’s extensive “cc lists.” 
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Abstract Let Yxy — Xx be a Galois covering of smooth curves over a field 
of characteristic 0, with Galois group G. We assume K is the fraction field of a 
discrete valuation ring R with residue characteristic p. Assuming p”} G and the p- 
Sylow subgroup of G is normal, we consider the possible reductions of the covering 
modulo p. In our main theorem we show the existence, after base change, of a 
twisted curve V — Spec(R), a group scheme G —> ¥ and acovering Y > ¥ 
extending Yx — Xx, with Y a stable curve, such that Y is a G-torsor. 

In case p?|G counterexamples to the analogous statement are given; in the 
appendix a strong counterexample is given, where a non-free effective action of 
a’, on a smooth |-dimensional formal group is shown to lift to characteristic 0. 
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1 Introduction 


1.1 Reduction of coverings of degree divisible by p 


Let R be a discrete valuation ring of mixed characteristics, with spectrum 
S = Spec R. Denote the generic point 7 with fraction field K, and the special 
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point s with residue field k of characteristic p > 0. Consider a generically smooth, 
stable pointed curve Y — S with an action of a finite group G of order divisible 
by p. Denote X = Y/G. We assume that G acts freely on the complement of the 
marked points in Y,,; it then follows that G respects the branches of all nodes of Y,. 

In situations where the order of G is prime to the residue characteristic, the 
reduced covering Y, > X, is an admissible G-covering, and a nice complete moduli 
space of admissible G-coverings exists. An extensive literature exists describing 
that situation, see e.g., [H-M, Mo, Ek, W, 8-C-V]. However, in our case where the 
residue characteristic divides the order of G, interesting phenomena occur (see e.g., 
[S8-Oo]). The situation was studied by a number of people; we will concern ourselves 
with results of Raynaud [Ra] and, in a less direct way, Henrio [He] and more 
recently Maugeais [Ma]. Related work of Saidi [Sal,Sa2,Sa3], Wewers and Bouw 
[W1, W2, W3, Bo, B-W1, B-W2], Romagny [Ro] and others provides additional 
inspiration. In [S-O-V2, Section 5] the curve Y is replaced by something which 
could be much more singular, and therefore the results are somewhat orthogonal to 
the situation here. 

Thus, in our case where p | |G|, the covering Y — X is no longer generically 
étale on each fiber. It is natural to consider some sort of group-scheme degeneration 
G — X of G, in such a way that Y might be considered something like an 
admissible G-covering. In our main theorem we show this is the case under 
appropriate assumptions: 


Theorem 3.2.2. Assume p* } |G| and the p-Sylow subgroup of G is normal. 
There exist 


(1) atwisted curve ¥ > X, 

(2) a finite flat group scheme G > 1, 

(3) ahomomorphism Gy — G which is an isomorphism on “x, 
(4) alifting Y — VY of Y > X, and 

(5) an action of G on Y through which the action of G factors, 


such that Y — is a principal G-bundle. 
The formation of G commutes with any flat and quasi-finite base change R C R’. 


It is important to note that, unlike the characteristic 0 case, X is not a stable 
pointed curve in general. 


1.2 Background 


Raynaud ([Ra], Proposition 1.2.1) considered such a degeneration locally at the 
generic points of the irreducible components of X;, in the special case where 
|G| = p; our first goal, see Theorem 3.1.1 below, is to work out its extension to the 
smooth locus of X, and slightly more general groups, where as above p” + |G| and 
the p-Sylow subgroup of G is normal. The case where p* | |G| remains a question 
which I find very interesting. See Example 2.1.7 and the appendix by J. Lubin for a 
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negative result in general, the discussion of question 2.1.5 for positive results in the 
literature, and Remark 2.1.8 for a positive result for small ramification. 

One still needs to understand the structure of Y — X at the nodes of X, and 
Y,. Henrio, working p-adic analytically, derived algebraic data along X,, involving 
numerical combinatorial invariants and differential forms, which in some sense 
classify Y; — X,. Our second goal in this note is to present a different approach to 
such degenerations at a node, modeled on twisted curves, i.e. curves with algebraic 
stack structures. Borrowing a metaphor from A. Ogus, these twisted curves have 
served well in the past as a sort of “magic powder” one sprinkles over the “bad 
locus” of certain structures, which brings about a hidden good property. The point 
here is that, just as in [&-C-V], the introduction of twisted curves allows one 
to replace Y — X by something that is actually a principal bundle. Unlike the 
case of residue characteristics prime to |G|, the twisted curves will in general be 
Artin stacks rather than Deligne-Mumford stacks. See [Ol], [8-O-V2]. 


1.3. Towards a proper moduli space 


The main theorem should be thought of as a first step in constructing a nice proper 
moduli space of degenerate coverings in mixed characteristics - it gives a special 
case of the valuative criterion for properness. In joint work with M. Romagny 
we plan to complete this task. Foundations have only recently been developed in 
[S8-O-V, 8-O-V2]. 


1.4 Brief introduction to twisted curves 


A twisted pointed curve over a base scheme S is a diagram as follows: 


ye C+ € 


| | 


Here we follow [8-O-V2, Section 2]: 


¢ C > S isa usual n-pointed nodal curve, with sections ze, i eee 
e C is an algebraic stack with finite diagonal having C as its coarse moduli space; 
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* =¢ C C are its markings, each of which a gerbe banded by some [L,, Over 
MP 8: 

¢ C — C isan isomorphism away from nodes and markings of C; 

* at a marking of C, where the strict henselization C*" is described by 
(Spec, Os[x])** and YO is the vanishing locus of x, the twisted curve C is 
described as 

[ (Specs Os[u])"/H,, J. 
where j,, acts on u via the standard character and uw" = x, and ue is the quotient 
of the vanishing locus of u; 


* at anode of C, where the strict henselization C™ is described by (Specs Os 
Lx, y]/ (xy — f))™ with f € (Os)*", the twisted curve C is described as 


[ (Specs Os [uv / (uv — 8)" / ay | 
for some r, where jz, acts via 


(u,v) (¢,u, €'v), 


and u” = x,v’ = yandg’ = f. 


Of course the description on the level of strict henselization descends to some 
étale neighborhoods. In case p divides r; or r, the twisted curve C is not a 
Deligne—Mumford stack, and it is a little bit of a miracle, following from [S8-O-V2, 
Proposition 2.3], that one can use such a nice description locally in the étale 
topology (or on strict henselizations) rather than the f.p.p.f. topology. The reader 
is warned that transition isomorphisms between the étale local charts are in general 
not given in étale neighborhoods but rather in smooth or f.p.p.f. charts. 

Near a marking re, the twisted curve is determined, uniquely up to a unique 
isomorphism, by the choice of r;. In fact locally in the Zariski topology of C we 


can write C = 4/(C, ay, see [X-G-V] for notation and proof. Near a node, C is 
still uniquely determined by r, but not up to a unique isomorphism - in fact Autc C 
has a factor yx, for each such twisted node of index r, see [X-C-V]. 
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2 Extensions of group-schemes and their actions 
in dimension 1 and 2 


2.1 Raynaud’s group-scheme 


Raynaud (see [Ra], Proposition 1.2.1, see also Romagny, [Ro]) considers the 
following construction: let U be integral and let V/U be a finite flat G-invariant 
morphism of schemes, with G finite. Assume that the action of G on the generic 
fiber of V/U is faithful. We can view this as an action of the constant group scheme 
Gy on V, and we consider the schematic image G of the associated homomorphism 
of group-schemes 

Gu — Auty V. 


Since, by definition, the image Gy — U is finite, we have that G — U is finite 
as well. The scheme G — U can also be recovered as the closure of the image of 
the generic fiber of Gy, which is, by the faithfulness assumption, a subscheme of 
Auty V. By definition G acts faithfully on V. 


Definition 2.1.1. We call the scheme G the effective model of G acting on V/U. 


Note that a priori we do not know that G is a group-scheme. It is however 
automatically a flat group-scheme if U is the spectrum of a Dedekind domain. This 
follows because, in that case, the image of G xy G — AutyV is also flat, and 
therefore must coincide with G. 

Also note that, if s is a closed point of U whose residue characteristic is prime 
to the order of G, then the fiber of G over s is simply G. So this effective model is 
only of interest when the residue characteristic divides |G]. 

The following is a result of Raynaud, see [Ra], Proposition 1.2.1. The statement 
here is slightly extended as Raynaud assumes |G| = p: 


Proposition 2.1.2. Let U be the spectrum of a discrete valuation ring, with special 
point s of residue characteristic p and generic point yn. Let V — U be a finite 
and flat morphism, and assume that the fiber V, of V over s is reduced (but not 
assuming geometrically reduced). Assume given a finite group G, with normal p- 
Sylow subgroup, such that p? + |G|, and an action of G on V such that V > U is 
G-invariant, and such that the generic fiber V, — {n} is a principal homogeneous 
space. Let G — U be the effective model of G acting on V/U. 
Then V/U is a principal bundle under the action of G > U. 
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Remark 2.1.3. An analogous construction in a wider array of cases is given in 
Romagny’s [Rol, Theorem A]. Romagny does not aim to construct a principal 
bundle; on the other hand he shows that an effective model for an action exists 
even if V/U and G/U are not finite, under very mild hypotheses. 


Proof. As in Raynaud’s argument, it suffices to show that the stabilizer of the 
diagonal of V; xy Vs; inside the group-scheme V, xy G is trivial. Since G acts 
transitively on the closed points ¢; of V; sending the stabilizer on ¢; to that over f;, 
it is enough to show that one of these stabilizers, say over t € Vs, is trivial. But this 
stabilizer P is a group-scheme over the residue field k(t) with degree deg P | p, 
and if nontrivial it is of degree exactly p. In such a case it must coincide with the 
pullback of the unique p-Sylow group-subscheme of G; therefore that p-sylow acts 
trivially, contradicting the fact that G acts effectively. a 


Remark 2.1.4. In case the inertia group is not normal, Raynaud passes to an 
auxiliary cover, which encodes much of the behavior of V > U. 


Question 2.1.5. What can one say about the action of G on V in case the order 
of G (and the degree of V — U) is divisible by p*, but the inertia group is still 
normal? Specifically, what happens if |G| = p?? 


In the latter case, consider a subgroup P C G of order p. It can be argued, as in 
Raynaud’s proof, that the effective model P — U of P acts freely on V, and thus 
V — V/P isa principal P-bundle. Similarly, if Q is the effective model of G/P 
acting on V/P, then V/ P — U isa principal Q-bundle. At the same time, we have 
an action of the effective model G of G on V/P, but it is not necessarily the case 
that G/P — Q is an isomorphism. 

While the statement of Question 2.1.5 is somewhat vague, two definite answers 
can already be given. First, if one concentrates on effective models of the action in 
the sense of Romagny, a great deal can be said. The recent work of Tossici [To1,To2] 
concentrates on the case where Gx = Z/p*Z and Oy contains a primitive root of 
unity of order p?. The paper [Tol] describes explicitly the possible models G of 
Gx; in [To2] an explicit description of the effective model of Gx acting on V is 
provided. I think it would be of interest to see if results as in Theorems 3.1.1 and 
3.2.2 can be obtained for more general effective models such as these. 

Second, in general no model of G will act freely on V. This is the case even for 
some of the prettiest actions one can consider. This makes giving a complete answer 
to the previous Question 2.1.5 tricky, and underscores the importance of work such 
as Tossici’s. 

As probably the simplest example, consider an action of Go = (@ i = 
Spec k[a, b]/(a’, b”) on k(t). Examples of liftings of a non-free action of the type 


tt tt+ad f(t)b 


Raynaud’s group-scheme and reduction of coverings 7 


for any residue characteristic have been written down by Romagny (personal 
communication) and Saidi (see [Sa4]). The case of 


tt tt+a+t?b (1) 


is particularly appealing, as it involves torsion and endomorphisms of a formal 
group. I therefore ask 


Question 2.1.6. Can one lift the action (1) to characteristic 0? 


A formal positive answer in arbitrary residue characteristics is given by Jonathan 
Lubin in the appendix. Here I discuss explicitly the case of residue characteristic 
2, where this action can be obtained as a reduction of an action of (Z/2Z)* ona 
smooth curve. I concentrate on the local picture (making it global is not difficult): 


Example 2.1.7. Let R = Zs[./2]. Consider the group-scheme Y/R defined by 
te! = tte + V2tt' 


This is an additive reduction of the multiplicative group. The reduction of the 
subgroup {12 1s given as 


Spec R[a] / (a(a+ V2)), 
reducing to @. It acts on Y by translation via the addition law as above: 
tre tt+at v2at. 


The reduction of the action of Z/2Z by inversion is the same group-scheme, 
again reducing to @2, which we write as 


Spec Rib] / (b (b+ V2). 


This time the action is given by 


bt? 
haia/Je 


Since 2-torsion is fixed by inversion, these actions commute. Explicitly, the 
action of the product is given by 


i (1+ 26) 1 = 


bt? 
1+ J/2t 


bt? 
+ Via ((1 + vio) - —*—), 


tre a + (1 + v26)0 - 
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The reduction modulo V2 is given by 
tettat?b, 


as required. 


Remark 2.1.8. Raynaud’s arguments do work when p’| |G| if the p-Sylow group- 
scheme of G has only étale and cyclotomic Jordan—H6lder factors. This is because, 
in that case, there are no nonconstant group subschemes in the reduction. In 
particular this works whenever the absolute ramification index over Z, is < p. 


2.2. Extension from codimension I to codimension 2 


Consider now the case where U a Gorenstein noetherian scheme, dim U = 2, and 
V/U finite flat and G-invariant as above. Consider the S2-saturation G’ > G of the 
effective model G of the G action on V/U. In Section 6.1.2 of [Va] Vasconcelos 
considers such saturation (S2-ification in his terminology). His Proposition 6.21 on 
page 318 applies in our situation, and gives the existence and a characterization of 
the S2-saturation. We have 


Lemma 2.2.1. If G’ > U is flat, then G' is a group-scheme acting on V. 


Proof. We claim that the rational map G’ xy G’ --> G’ induced by multiplication 
in AutyV is everywhere defined. Indeed the graph of this map is finite over 
G’ xy G’ and isomorphic to it over the locus where G’ — G is an isomorphism, 
whose complement has codimension > 2. Now G’ is S2 and of dimension 2, hence 
Cohen—Macaulay. Pulling back to G’ the flat Cohen—Macaulay G’ — U we get 
that G’ xy G’ is Cohen—Macaulay, in particular S2. This implies that the graph of 
G' xy G’ --> G’ is isomorphic to G’ xy G’, and therefore the map is regular. The 
same works for the map defined by the inverse in Auty V. This makes G’ a group- 
scheme, and the map G’ — Auty V into a group-homomorphism. Oo 


This applies, in particular, when U is regular: 


Lemma 2.2.2. If U is regular, the S2-saturation G’ of the effective model G is a 
finite flat group-scheme acting on V. 


Proof. Again G’, being 2-dimensional and S2, is Cohen—Macaulay, and being finite 
over the nonsingular scheme U, it is finite and flat over U (indeed its structure sheaf, 
being saturated, is locally free over the nonsingular 2-dimensional scheme U). The 
result follows from Lemma 2.2.1. oO 


When the action on the generic fiber is free, we have more: 


Proposition 2.2.3. Let U be a Cohen—Macaulay integral scheme with dim U = 2. 
Let V — U bea G invariant, finite, flat and Cohen—Macaulay morphism, and 
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assume the action of G on the generic fiber is free. Let G — U be the effective 
model of the action. Assume that for every codimension —1 point &, the action of the 
fiber Gz on V~ is free. 

Then 


(1) G > U isa flat group-scheme, and 
(2) The action of G on V is free. 


Note that, by Raynaud’s result 2.1.2, the assumptions hold when U = V/G is 
local of mixed characteristics (0, p), the fibers V; are reduced, the p-Sylow of G is 
normal and p” + |G|. 


Proof. Consider the S2-saturation G’ of G. Since V — U is flat and Cohen— 
Macaulay, the same is true for V xy V > V and for G’ xy V > G’. Since G’ and V 
are Cohen—Macaulay, we have that V xy V and G’ xy V are Cohen—Macaulay, hence 
S2, as well. The morphism G’ xy V — V xy V induced by the action G’ > Auty V 
is finite birational and restricts to an isomorphism in codimension |. By the S2 
property it is an isomorphism. In particular we have that G’ xy V — V is flat, and 
since V — U is faithfully flat we have that G’ — U is flat. By Lemma 2.2.1 we 
have that G’ > U isa finite flat group-scheme acting on V, and the isomorphism 
G’ xy V > V xy V shows that the action is free, in particular G’ > G is an 
isomorphism. Oo 


3 Curves 


3.1. The smooth locus 


The main case of interest for us is the following: 

Let R be a complete discrete valuation ring of mixed characteristic, with fraction 
field K of characteristic 0, residue field & of characteristic p > 0, and spectrum S. 
Assume Y — S is a stable pointed curve with smooth generic fiber, G a finite group 
acting on Y over S, and denote 


X=Y/G. 


We assume that the closure of the locus of fixed points of G in Yx forms a disjoint 
union of marked sections of the smooth locus Y;m. Hence for every node y € Y, 
the stabilizer in G of y keeps the branches of Y at y invariant. We denote the 
complement of the closure in Y,, of the fixed locus of the generic fiber by Yen, 
and the image in X by Xen — the so called general locus. 

Note that the morphism Ysm — Xm is flat. 

The propositions above give: 


Theorem 3.1.1. Assume p + |G| and the p-Sylow subgroup of G is normal. 
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There exist 


(1). a finite flat group-scheme Gsm > Xm, 
(2). a homomorphism Gy,,, > Gsm which is an isomorphism over X x, and 
(3). an action of Gsm on Ysm through which the action of G factors, 


such that Ygen — X gen is a principal Gyn-bundle. 
The formation of Gsm commutes with any flat and quasi-finite base change 
eR, 


Proof. Let Gsm — Xsm be the S2-saturation of the effective model of the action of 
G on Ysm. AS Xgm is smooth we can apply Lemma 2.2.2; therefore Gsm — Xm is a 
finite flat group-scheme acting on Ysm, giving (1) and (3). Part(2) applies since over 
K the group G does not degenerate. 

The assumptions on G mean we can apply Proposition 2.1.2, so the action of 
Gsm|X gen ON Yen is free in codimension-1. We can therefore apply Proposition 2.2.3, 
and obtain that Ygen — Xen is a principal bundle. 

The formation of G, clearly commutes with base change when restricted to the 
locus where it acts freely, and also over Xx. As it is flat and $2, its formation also 
commutes with base change across the remaining codimension-2 locus. Oo 


It would be really interesting to see what happens for other groups G. 


3.2. The structure of Y and G over nodes and markings of X 


What can be done about the singular points and markings of X and Y ? It is easy 
to see that even in the case of characteristic 0, the cover Y — X is not a principal 
bundle in general; it is already not a principal bundle at the fixed points of Yx, 
and rarely a principal bundle at the nodes. However, the behavior of Y —> X at 
the nodes is very interesting. My suggested approach here is to follow the method 
of [8-V1, 8-V2, 8-C-V, Ol, 8-O-V2] using twisted curves. Let us first consider the 
cover Y — X itself and investigate its structure from this point of view. 

Consider first anode P € X where étale locally X‘" is described by the equation 
xy = x", with z a uniformizer in S. Similarly, take a node QO € Y over P with 
local equation st = 2”. Say the local degree of Y > X at Q is d, so without loss 
of generality we can write x = s¢ju and y = ¢@v, where jz and v are units on Y°", 
Comparing the Cartier divisors of x, y,s,¢ and x on Y*", we get that m = dn, and 
jv = 1. Note that, since G acts transitively on the points of Y lying over P € X, 
the degree d is independent of the choice of QO, and we may denote it dp, to indicate 
its dependence on P. 

Consider now the twisted curve V having index dp at each node P. Recall from 
above that it has local description 


[ (Specs Os luv] / (w= 2"))" / pa J. 
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Write Z = (Spec, Os [u, v] / (uv — m"))S", We stress again that up to a non-unique 
isomorphism “ does not depend on the choice of local coordinates. 


Lemma 3.2.1. There is a lifting, unique up to a unique isomorphism, of Y > X to 
a finite flat Cohen—Macaulay morphism Y > %. 


Proof. Recall that the coordinate s on Y* is related to x viax = s@j with pa unit. 
Consider the z;-cover P — Y*" given by 


P = Spec Oys|[w]/(w4 — p), 


using the same unit 4, where fw, acts via w +> Caw. Define a morphism P > Z 
via u = sw and v = t/w. This morphism is clearly equivariant, giving a morphism 
Y" > [Z/pp] = X". Since (sw)? = x = uw? and (t/w)? = y = Vv? this 
lifts the given map Y — X. It is a tedious but straightforward exercise to show 
that the morphism on strict henselization descends to give the required morphism 
Y — &. The uniqueness statement follows from the fact that ¥ is a separated 
stack. To check that Y — -¥ is flat it suffices to show P —> Z flat. This follows 
from the local criterion for flatness: the fiber over u = v = O is given by 
s =t =0,w? =c where c is the constant coefficient of oats =t = 0. Thisisa 
scheme of degree precisely d as required. Since Y and ¥ (or, for that matter, P and 
Z) are Cohen—Macaulay, the morphism is Cohen—Macaulay. Oo 


We now have our main theorem: 


Theorem 3.2.2. Assume p* + |G| and the p-Sylow subgroup of G is normal. 
There exist 


(1) a twisted curve X > X, 

(2) a finite flat group scheme G > %, 

(3) a homomorphism Gx — G which is an isomorphism on Xx, 
(4) alifting Y > X of Y > X, and 

(5) an action of G on Y through which the action of G factors, 


such that Y > & is a principal G-bundle. 
The formation of G commutes with any flat and quasi-finite base change R C R’. 


Proof. There are two issues we need to resolve here: the construction of G at the 
nodes, and the construction of V and G at the markings. 

First we need to extend G over nodes. We have that Y — 4 is flat and 
Cohen—Macaulay at the nodes; by Theorem 3.1.1 we have that Yen — Xgen iS a 
principal bundle under Gen. By Proposition 2.2.3 the effective model G — 4 of the 
action of Gy on the V-scheme Y is a finite flat group-scheme over +’, and away 
from the markings Y is a principal bundle. 

Next, we deal with the markings: the local picture of Yx — Xx at a marking is 
Ys" — (Spec R[s])°" and X8* = (Spec R[x])** where x = s@, and the stabilizer in 
G of s = 0 on Y is identified with w,, acting vias +» ¢s. We give ¥ the unique 
structure of a twisted curve with index d along this marking; locally around the 
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marking s = 0 we have V" = [(Spec Os[u])*"/1,]. The discussion above shows 
that Yx — “x is a principal G-bundle. Applying Proposition 2.2.3 again we obtain 
that G is a group-scheme and Y is a principal bundle. Oo 


So, in view of the characteristic 0 discussion in [8-C-V], we might call VY > x 
a twisted G-bundle. 

This suggests an approach to lifting covers from characteristic p to characteristic 
0, by breaking it in two stages: (1) lifting group-schemes over -V, and (2) lifting the 
covers. Recent work of Wewers [W3] seems to support such an approach. 


Appendix A. Lifting a non-free action on a formal group 


by Jonathan Lubin 


The Question. In characteristic p > 0, consider the substitution t > a +t+bt?, 
where a? = b? = 0. This clearly defines a group-scheme of rank p”, isomorphic 
to @» X apy, and an action of the groups-cheme on a curve, in this case the affine 
line. Question 2.1.6 asked whether this group-scheme and this action can be lifted 
to characteristic zero, over a suitably ramified extension of Zp. 


The Answer. It’s a partial yes, in that the example presented here shows an action 
not on the affine line but on the formal version of this, the formal spectrum of O[[f]], 
where 9 is the ring of integers in a well-chosen ramified extension of Q,. But if the 
question is whether there is any example of an action of a, x a@, on a genuine 
algebraic curve in characteristic zero, then I must plead ignorance. 

In general, if R is a ring and f and g are power series in one variable over R, 
then it makes no sense to compose the series, f og, unless g has zero constant term. 
Yet, there are situations where R has a suitable complete topology, when f o g can 
make sense even when g(0) ¥ 0. Let us detail one fairly general such situation: 

If (0,m) is a complete local ring, then on the category of complete local 
o-algebras (R, M) we define a group functor denoted S$ or %,, such that B(R) 
is the set of power series 


fO=Dlet! € Ri 


jz0 


for which co € M and c; ¢ M. Our desire is that 8,(R) should be a group 
under composition of power series, and indeed the condition on co guarantees that 
composition will be well defined, while the condition on c; guarantees that the series 
will have an inverse in 8(R). One sees now that if « is the characteristic- p field of 
definition in Question 2.1.6, and if R is the local «-algebra xa, b]/(a?, b”), then 
the series a + t + bt? is an element of B,(R). The relation 


(at+t+bt?)o(at+t+b't?)=(ata)t+t+(b4+bd'yt? 


shows that the group-scheme that’s being described is finite and isomorphic to 
Ap X Ap. 
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The Method. We take a formal group F of finite height that has a subgroup of 
order p as well as a group of automorphisms of order p. Now, finite groups of 
automorphisms of a formal group of finite height are always étale, but by taking 
a slight blowup of F, we convert the automorphism subgroup to a local group- 
scheme, without going so far as to make the above group of torsion points of F 
étale as well. Then, allowing ourselves a slight abuse of language, our desired lifting 
consists of all substitutions 


tre attt[b]r(t), 


where a is a torsion point of the blow-up of F’, and 1 + b is a p-th root of 1. In 
the displayed formula, F’ is the blown-up version of F’, the tilde over the plus-sign 
indicates addition with respect to F’, and as usual, [b]-/(t) is the endomorphism 
whose first-degree term is bt. I suppose that very confident people may be able to 
look at the preceding explanation and say, Of Course, No Problem, End of Story. 
But I’m not so confident, and the rest of this paper is devoted to filling in the gaps 
and making sure, to my own satisfaction at least, that everything is on the up and 
up. To those confident readers, everything from here on may thus be unnecessary, 
though the summary 1-7 at the end of this note may be an aid to flagging assurance. 


A.1 Some algebra 


Let ¢ = ¢, be a primitive p-th root of 1 in an algebraic extension of Q,, and let 
o = Z,[¢]. Let also = ¢ — 1, a prime element of 0, and let k be the fraction field 
of o. In the ring o[T]/(T?” — 1), let us call I the image of 7, and let us consider 
A= fot Then the minimal polynomial for A is 


-—1 
Porp-l oe P(p ppg eg P(P Vp2 4 P T, (*) 
4 


TP + = 
2n? Qn P-2 qe! 


in which the coefficient of T is a unit in 0 congruent to —1 modulo z. Let us call 
B the ring o[A]; we need to establish a few facts about it. I will use capital Greek 
letters for elements of B, lower case Greek letters for elements of o. 


Lemma A.1.1. The ring B is isomorphic to 0 ® 0 ® --: ® 0, with p factors. In B, 
every element © satisfies the condition that 0? — © € mB. 


Proof. The minimal polynomial for A, described above, is T? — T modulo z. By 
Hensel’s lemma it splits into dinstinct linear factors over the complete local ring 0, 
so that the first part of the statement is verified. Since each element 6 € o has the 
property that 6” — B € so, the corresponding property holds for elements of B as 
well. It may be of interest to note that this is not true of the subring o[I’] of B. 
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A.2. Endomorphisms of the fundamental formal group 


We start with the polynomial f(t) = at +t? € ol[t]], which has associated to it 
a unique formal group F(x, y) € o[[x, y]] for which f € End,(F), as proved in 
[LT]. The following is hardly surprising: 


Lemma A.2.1. For each © € B, there is a unique series [©]r(t) € B|[t]] such 
that [©]; (0) = © and f 0[@]r = [©] Fr ° f; this series is an element of Endg(F). 
In particular for © = x we have |1|r = f. 


This may be proved by using either of the halves of Lemma A.1.1; if one wishes 
to use the fact that ©? — @ is always in zB, then the proof of the first Lemma in 
[LT] goes through word-for-word. The statement [z]- = f follows by uniqueness. 

The endomorphism ring Endg(F’) contains in particular the series [A]- and 
[[']r; the p-fold iterate of the latter series is the identity series t. And since 
l = 1+ 2A, our periodic series [[']-(t) may also be written as 


F(t, ([t]r 0[A]r) ()). 


If B is an element of ker([z]-), then the series tg(¢) = F(t, 6) commutes with both 


lee) = F(t, [x]r@) 


and 
(]r@) = FC, (Ale olz]r)@). 


If only B=o[A] had not been an étale o-algebra, we could have taken 
ker([z]-) x Spec(B) as our desired lifting of a, x a». After all, the points of 
ker([z]r) are the 6’s mentioned above, and the points of Spec(B) are essentially 
the p-th roots of unity &, and the substitution 


tt> F (B,[E]r()) 


would be our lifting of the substitution mentioned in the introduction. There is the 
additional problem that in case p = 2, F is of height one and so ker[z] is not a 
lifting of w,, but the étaleness of the other factor is a much bigger obstacle. 

Because of the form of f(t) = [z]r(t) = at + t?, not only F but also all the 
B-endomorphisms [©]; have the property that the only nonzero terms are in degrees 
congruent to | modulo p — 1. Any such series can be written, that is, in the form 
po H,; where each H; is a form or monomial of degree 1 + j(p — 1). For want 
of a better term, I will call any series with this last property (p — 1)-lacunary. 

Now I want to let D be any complete local o-algebra in which z is no longer 
indecomposable, x = Aj where both A and yw are nonunits. Minimally, one may 
take A = pp = ./x and D = o[./z]. Or we may let be the ring of integers in any 
properly ramified algebraic extension K of k, and A any element of K with valuation 
0 < v(A) < v(z) = 1. Or, generically, we can take D = o[[A, 2/A]], a ring that can 
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be described alternatively as o[[A, jz]]/(Ajz — 2) or as the set of all doubly infinite 
Laurent series }* jen Oj A/ in the indeterminate 4 and with coefficients a; € 0 
satisfying the additional condition that 7 + v(@;) = 0, where v is the (additive) 
valuation on o and k normalized so that v(r) = 1. 

If G is a (p — 1)-lacunary series in one or more variables, I will call the 
A-blowup of G, denoted by G™), the series formed from G in the following way: 
ifG = bare, H;, each H; being homogeneous of degree 1 + j(p — 1), then 
GY = Y j>04 Aj. 

When we apply the above operation to F and its endomorphisms, here is what 
happens: F“) becomes a formal group whose reduction modulo the maximal ideal 
of 9 is just the additive formal group x + y. The maps End,(F) — Endg(F™) 
and Endg(F) > Endgg,9(F™) that take g(t) to g(t) are injections. For any 
© € B, I will write [0] for [O]-a) = ((O]r); then since [x] (t) = at + 
At? = A(ut +t”), the new formal group F” has at least one nontrivial finite 
subgroup, namely the set of roots of wt + ¢?, under the group law furnished by 
F), and they certainly are the geometric points of Spec (O[f]/(ut + t”)), but this 
is not the kernel of [7], since the standard construction of kernel in that case 
leads to something that’s not flat. Rather, if we call g(t) = ut + ¢?, then the finite 
group-scheme we’re talking about is the kernel of g : F® => F?°), 

Seeing just how a group scheme lifting a, acts on F “) is a little trickier and 
more unusual. Our aim is to show that the automorphism [I"]™(t) of F lies in 
O[A’][[t]], where A’ = 1A has the D-minimal polynomial 


A —1)A? —1)aP-? 7 te 
re Pape 4 PP ee TP oa PP ae r+ T (**) 
IU IU IU 4 
-1 —1 
= pe 4 Ppp 4 PP : )pr-2 4.4 PY Yr 4 P_f, 
2 2p? bP 


Note that this polynomial is congruent to 7” modulo the maximal ideal MN of O. 

Now recall that 7 = 1+ Az, so that the series [J"](t), which is periodic of period 
p with respect to substitution of series, whether we are talking about automorphisms 
of the original F or of the blown-up F™), can be written 


]@) = F(t, (Alolx)@). 


Since every element of B is an o-linear combination of {1, A,..., A?~!}, we may 
write 
[A]e(t) = At+ >> Cjt/?-)+ © Biel, 
j21 
where, as remarked, each coefficient C; is an o-linear combination of the powers of 
A, up to A?~!, It follows that [A], the corresponding endomorphism of F,, has 
the form 
[A]ew(t) = At+ S° CAPT € Oley], 


j21 
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where the C;’s are the same in both displayed formulas. Now, what of [A]™ o 
[x] = [A] (art + At?)? Making the indicated substitution gives 


A (xt + At?) + S°CjAl (at + AtPi@-Y41 


j21 


= A’ (pt +t?) + > Carrhae 4 PF P-D+1 


But now because C; € B = o[A], we also have C;A/?t! € AD[A’] = ADA], 
since the j’s all are at least 1. This shows that [2 A]”(t) is a power series with 
coefficients in O[A’], and indeed, modulo Mt, this series is just A’t?. Finally, when 
we add this series and the series ¢ by means of the formal group F” (x, y) = 
x+y mod M, the result, namely [7] (t), has coefficients in B’ = O[A’], and is 
congruent modulo 9% to t + A’t?. One more remark is necessary, the obvious one 
that if wa + a? = 0, then [Ax]™ (aw) = 0 and [[]™ (a) = a. 
In summary, this is what we now have: 


(1) The ring o is Z,[¢], where ¢ = ¢, is a primitive p-th root of unity, and we use 
the prime element x = €—1. 

(2) The ring is any suitably ramified extension of 0, the minimal example being 
D = o[./z]. This 9 is the ring over which our liftings and action are defined, 
and we identify in it elements A, € O withAy = z. 

(3) The formal group F over o has wt + t” as an endomorphism and thus has o as 

its ring of “absolute” endomorphisms (over the ring of integers of any algebraic 

extension field of the fraction field of 0). Allowing for abuse of language, there 
is a unique o-subgroup-scheme of F of rank p, namely ker[z]- = Spec(A), 
where A = o[[e]]/ ([7]r(¢)). 

The finite o-algebra B is o[A], where the minimal polynomial for A over o is 

given in formula (*). Algebraically, B is o®?, and when we call 

Ir = 1+a/A € B, we have '”? = 1. The scheme Spec(B) is a finite 

étale group-scheme of order p; the element I” € B is a generic p-th root of 

unity, and the operation of the étale group-scheme on the formal-affine line is 

t > [P]e(t) = F (t, (] 0 [AD(). 

We use A € 9 to forma sort of blowup of F,, which we call F” and which is 

described on the preceding page. This formal group has the subgroup-scheme 

Spec(A’), where A’ = O[ft]|/(wt + ¢?), and this group-scheme acts on the 

formal-affine line by the substitution ¢ — F(A)(a,t) when a is any root of 

pt +t?. 

(6) We define A’ = AA € B @, 9, and note that A’ has the minimal polynomial 

over D given by (**) on the preceding page. Call B’ = O[A’]. The periodic 

power series [I"]™ (1), originally defined to be in B @, OJ[¢]], actually is in 

B'|[t]] and as an element of this ring, it becomes ¢ + A’t? in B’ @p O/M|[e]]. 

Since the series ["]™(t) and the F“(a,t) mentioned in (5) commute, we 

do indeed have a finite group-scheme, namely Spec(A’ ® B’), acting on the 

formal-affine line in such a way that over 0/9, the action ist H+ a+r+A’t?. 


(4 


wm 


(5 


wm 


(7 


wa 
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Abstract The modular degree and congruence number are two fundamental 
invariants of an elliptic curve over the rational field. Frey and Miiller have asked 
whether these invariants coincide. We find that the question has a negative answer, 
and show that in the counterexamples, multiplicity one (defined below) does not 
hold. At the same time, we prove a theorem about the relation between the two 
invariants: the modular degree divides the congruence number, and the ratio is 
divisible only by primes whose squares divide the conductor of the elliptic curve. 
We discuss the ratio even in the case where the square of a prime does divide 
the conductor, and we study analogues of the two invariants for modular abelian 
varieties of arbitrary dimension. 
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1 Introduction 


Let E be an elliptic curve over Q. By [BCDTO1], we may view E as an abelian 
variety quotient over Q of the modular Jacobian Jo(N), where N is the conductor 
of E. We assume that the kernel of the map Jo(N) — E is connected, i.e., that E 
is an optimal quotient of Jo(N) (this can always be done by replacing E by an 
isogenous curve if needed). The modular degree m, is the degree of the composite 
map X)(NV) — Jo(N) — E, where we map Xo(N) to Jo(V) by sending P € 
X(N) to [P] — [co] € Jo(N). 

Let fe = )oanq" € S2(To(N),C) be the newform attached to E. The con- 
gruence number r; of E is the largest integer such that there is an element g = 
> b,g" € So(To(N)) with integer Fourier coefficients b, that is orthogonal to fr 
with respect to the Petersson inner product, and congruent to fg modulo rz; (i.e., 
a, = b, (mod r;) for all n). 

Section 2 is about relations between r; and m,. For example, m, | rz. In [FM99, 
Q. 4.4], Frey and Miiller asked whether r, = m,;. We give examples in which 
rz # mg, and show that in these examples, there is a maximal ideal m of the 
Hecke algebra T, such that Jo(V)[m] has dimension more than two over T/m (this 
is the failure of multiplicity one alluded to above). We then conjecture that for any 
prime p, ord,(rz/mz) < Sord,(N), and prove this conjecture when ord,(N) < 1. 

In Section 3, we consider analogs of the modular degree and the congruence 
number for certain modular abelian varieties that are not necessarily elliptic curves; 
these include optimal quotients of Jo(N) and J; (NV) of any dimension associated to 
newforms. Section 3 may be read independently of Section 2. In Sections 4 and 5 we 
prove the main theorem of this paper (Theorem 3.6), and also give some examples 
of failure of what we call multiplicity one for differentials (see Definition 5.13). 
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G. Frey, H. W. Lenstra, Jr. and B. Noohi for discussions and advice regarding this paper. We 
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concerning a number of technical facts that are inputs to our arguments. The first author is also 
grateful to the Max-Planck-Institut fiir Mathematik for its hospitality during a visit when he partly 
worked on this paper. 


2 Elliptic curves 


In Section 2.1, we discuss relationships between the modular degree and the 
congruence number of an elliptic curve. In Section 2.2 we recall the notion of 
multiplicity one and give new examples in which it fails. 
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2.1 Modular degree and congruence number 


Let N be a positive integer and let Xo(N) be the modular curve over Q that 
classifies isomorphism classes of elliptic curves with a cyclic subgroup of order NV. 
The Hecke algebra T of level N is the subring of the ring of endomorphisms of 
Jo(N) = Jac(Xo(N)) generated by the Hecke operators T,, for all > 1. Let f 
be a newform of weight 2 for 9(V) with integer Fourier coefficients, and let J be 
kernel of the homomorphism T + Z[...,a,(f),...] that sends T,, to a,. Then the 
quotient E = Jo(N)/I¢Jo(N) is an elliptic curve over Q. We call E the optimal 
quotient associated to f. Composing the embedding Xo(V) — Jo(N) that sends 
oo to 0 with the quotient map Jo(N) — E, we obtain a surjective morphism of 
curves d; : Xo(N) — E. Recall that the modular degree m,; of E is the degree 
of de. 

Let S2(Z) denote the group of cuspforms of weight 2 on I'o(N) with integral 
Fourier coefficients, and if G is a subgroup of S2(Z), let G+ denote the subgroup 
of S2(Z) consisting of cuspforms that are orthogonal to f with respect to the 
Petersson inner product. The congruence number of E (really, that of f) is the 
positive integer r; defined by either of the following equivalent conditions: 


(i) rg is the largest integer r such that there exists g € (Zf)+ with f= 
g modr. 
(ii) 7, is the order of the quotient group eat: 
We say that a prime is a congruence prime for E if it divides the congruence 
number r;,. Congruence primes have been studied by Doi, Hida, Ribet, Mazur and 
others (see, e.g., [Rib&83, §1]), and played an important role in Wiles’s work [Wil95] 
on Fermat’s last theorem. Frey and Mai-Murty have observed that an appropriate 
asymptotic bound on the modular degree is equivalent to the abc-conjecture 
(see [Fre97, p.544] and [Mur99, p.180]). Thus, results that relate congruence primes 
and the modular degree may be of great interest. 


Theorem 2.1. Let E be an elliptic curve over Q of conductor N, with modular 
degree m,; and congruence number r;. Then m; | rg and if ord)(N) < 1, then 
ord, (r;:) = ord, (mz). 


Thus any prime that divides the modular degree of an elliptic curve EF is a 
congruence prime for E, and if p is a congruence prime for E such that p? 
does not divide the conductor of E, then p divides the modular degree of E. The 
divisibility m, | re was first discussed in [Zag85, Th. 3], where it is attributed to the 
second author (Ribet); however in [Zag85] the divisibility was mistakenly written 
in the opposite direction. For some other expositions of the proof that m,; | rz, 
see [AU96, Lem 3.2] and [CK04]. We generalize this divisibility and prove it 
in Theorem 3.6(a). The second part of Theorem 2.1, i.e., that if ord,(V) < 1, 
then ord, (rz) = ord,(m~z), follows from the more general Theorem 3.6(b) below. 
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Table 1 Differing Modular 


Curve m,; fr, Curve m,; Vg Curve Me Trg 
Dee ne enenee 54B1 2 6 O9AlL 4 #12 128Al 4° 32 
Number 
64Al 2 4 108Al 6 18 128Bl 8 32 
72Al 4 8 112Al 8 16 128Cl 4 32 
80Al 4 8 112B1 4 8 128D1 8 32 
88Al 8 112Cl1 8 16 135Al 12 36 
92Bl 6 120A1 8 16 144Al 4 8 
96Al 4 8 124Al 6 12 144B1 8 16 
96BI 4 8 126Al 8 24 


Note that [AU96, Prop. 3.3-3.4] implies the weaker statement that if p + N, then 
ord,(r;) = ord, (mz), since [AU96, Prop. 3.3] implies 


ord, (rz) — ord,(mz) = ord, (#C) — ord, (cz) — ord, (#D), 


and by [AU96, Prop. 3.4], ord,(#C) = 0. Here cz is the Manin constant of EF, which 
is an integer (e.g., see [ARS06]), and C and D are certain groups. 

Frey and Miiller [FM99, Ques. 4.4] asked whether rz = mz, in general. After 
implementing an algorithm to compute r; in Magma [BCP97], we quickly found 
that the answer is no. The counterexamples at conductor N < 144 are given in 
Table |, where the curve is given using the notation of [Cre97]. 

For example, the elliptic curve 54B1, given by the equation y? + xy + y = 
x? —x?4+x—1,hasr; = 6 and m, = 2. To see explicitly that 3 | r;, observe that 
the newform corresponding to E is f = g+q?+q*—3q°—q'+--+ and the newform 
corresponding to X(27) is g = q —2q*—q’+-:-,s0 2(q) + g(q’) appears to be 
congruent to f modulo 3. To prove this congruence, we checked it for 18 Fourier 
coefficients, where the sufficiency of precision to degree 18 was determined using 
[Stu87]. 

It is unclear whether there is a bound on the possible primes p that occur. For 
example, for the curve 242B1 of conductor N = 2: 11? we have 


my; = 24 fr, = 2*-11. 


We propose the following replacement for Question 4.4 of [FM99]: 


Conjecture 2.2. Let E be an optimal elliptic curve of conductor N and p be any 
prime. Then 


Tz 1 
ord, (=) < ord). 


We verified Conjecture 2.2 using Sage [S*09] for every optimal elliptic curve 
quotient of Jo(N), with N < 557. 
If p => 5, then ord,(N) < 2, so a special case of the conjecture is 


ord, (=) <1 for any p > 5. 


Me 
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2.2. Multiplicity one and its failure 


We say that a maximal ideal m of T satisfies multiplicity one if Jo(N)|m] is of 
dimension two over T/m. The reason one calls this “‘multiplicitly one” is that if 
the canonical two-dimensional representation p, over T/m attached to m (e.g., 
see [Rib90, Prop. 5.1]) is irreducible, then Jo(V)[m] is a direct sum of copies of pm 
[Rib90, Thm. 5.2], and a maximal ideal m of T satisfies multiplicity one precisely 
if the multiplicity of pm in this decomposition is one. Even if pm is reducible, the 
definition of multiplicity one given above is relevant (e.g., see [Maz77, Cor. 16.3]). 
The notion of multiplicity one, originally found in Mazur [Maz77], has played an 
important role in several places (e.g., in Wiles’s proof of Fermat’s last theorem: see 
Thm. 2.1 in [Wil95]). 

In [MR91, §13], the authors find examples of failure of multiplicity one in which 
if p is the residue characteristic of m, then p? | N, and pm is modular of level N/ p?. 
Kilford [Kil02] found examples of failure of multiplicity one where N is prime and 
the residue characteristic of m is 2. See also [Wie07] and [KW08] for examples of 
failure of multiplicity one in the | (VV) context. We now give examples of failure of 
multiplicity one where the square of the residue characteristic of m divides the level 
(the residue characteristic is often odd). 


Proposition 2.3. Suppose E is an optimal elliptic curve over Q of conductor N 
and p is a prime such that p | rg; but p + mg. Then there is a maximal ideal m of T 
with residue characteristic p such that dimr/m Jo(N)[m] > 2, i.e., multiplicity one 
fails for m. 


The proposition follows from the more general Proposition 5.9 below. It follows 
from the proposition above that any example in Table | where simultaneously 
a prime divides r; but does not divide m,; provides an example of failure of 
multiplicity one. In such examples, the associated representation p, may or may 
not be irreducible. For example, for the elliptic curve 54B1 and p = 3, we have 
ord3(rz) = 1 but ord3(m;) = 0, so there is a maximal ideal m with residue 
characteristic 3 such that multiplicity one fails for m. The curve 54B1 has rational 
3-torsion, SO Pm 1s reducible. On the other hand, for the elliptic curve 99A1, we have 
ord3(7,) = 1 but ord3(m,;) = O, so again there is a maximal ideal m with residue 
characteristic 3 such that multiplicity one fails for m. Moreover, Jo(99) is isogenous 
to a product of elliptic curves, none of which admit a rational 3-isogeny. Hence pm 
is irreducible. 

The notion of multiplicity one at a maximal ideal m is closely related to 
Gorensteinness of the completion of T at m (e.g., see [Til97]). Kilford [Kil02] 
found examples of failure of Gorensteinness (and multiplicity one) at the prime 2 
for certain prime levels. In the examples as above where multiplicity one fails for 
some maximal ideal, it would be interesting to do computations (e.g., as in [Kil02]) 
to see if the completion of the Hecke algebra at the maximal ideal is Gorenstein 
or not. 
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3 Modular abelian varieties of arbitrary dimension 


For N > 4, let T’ be either To(N) or | (N). Let X be the modular curve over Q 
associated to I’, and let J be the Jacobian of X. Let A and B be abelian subvarieties 
of J such that A+ B = J, AN B is finite, and every endomorphism of J over Q 
preserves A and B. In this section, we generalize the notions of the congruence 
number and the modular degree to subvarieties A as above, and state a theorem 
relating the two numbers, which we prove in Sections 4 and 5. 

We first give a general example of A and B as above. Up to isogeny, J is the 
product of factors J °1) where Ff runs over the set of newforms of level dividing NV, 
taken up to Galois conjugation, and e(f) is the number of divisors of N/N(f), 
where N(/f) is the level of f. Here Jy is the standard abelian subvariety of J 
attached to f by Shimura [Shi94, Thm. 7.14]. Let A’ be the sum of a ) for 
some set of f’s (taken up to Galois conjugation), and let B’ be the sum of all the 
other Jes, Clearly A’ + B’ = J. The J+’s are simple (over Q), hence A’ N B’ 
is finite. In view of the following lemma, A’ and B’ provide an example of A and B 


respectively as above. Note that by End(J) we mean the ring of endomorphisms 
of J defined over Q. 


Lemma 3.1. End(J/) preserves A’ and B’. 


Proof. Suppose End(J) does not preserve A’ (the case of B’ is symmetric). Then 
since the Jy’s are simple, that means that some abelian subvariety J, of A’ is 
isogenous to some abelian subvariety J, of B’, where g # h. Pick a prime @. 
If f is a newform, then let py denote the canonical absolutely irreducible ¢-adic 
representation attached to f. Now Q,@Vi(J f )* is a direct sum of copies of po, fp) as 
o ranges over all embeddings into Q of the field generated by the Fourier coefficients 
of f. Thus the above implies that there are distinct newforms g’ and h’ (of some 
level dividing N) such that pg = py’. Now each py satisfies tr(p(Frob,)) = 
ap(f) for all p + N2&. Thus for all p + N£, we have ap(g’) = a,(h’). By the 
multiplicity one theory (e.g., see [Li75, Cor. 3, pg. 300]), this means that g’ = h’,a 
contradiction. Oo 


We now give a more specific example, which will include the case of elliptic 
curves. Recall that T denotes the Hecke algebra. If f = }>a,(f)q” € S(T) isa 
newformand J = ker(T > Z[...,an(f),...]), then Ay = J/I/J is the newform 
quotient associated to f. It is an abelian variety over Q of dimension equal to the 
degree of the field Q(...,a,(f),...). Let 2 denote the quotient map J —> A. 
If C is an abelian variety, then we denote its dual abelian variety by CY. There is 
a canonical principal polarization 6 : J = JY. Dualizing ¢2, we obtain a closed 
immersion @y : AY. — JY, which when composed with 97! : JY & J gives us an 
injection @; : AY. <> J. One slight complication is that the isomorphism 6 does not 
respect the action of T, because if T is a Hecke operator on J, then on JY it acts as 
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Wy T Wn, where Wy is the Atkin—Lehner involution (see e.g., [DI95, Rem. 10.2.2]). 
However, on the new quotient J"”, the action of the Hecke operators commutes 
with that of Wy, so since the quotient map J — Aj, factors through J", the 
Hecke action on A¥ induced by the embedding AY. — J¥ and the action on AY 
induced by the injection ¢; : AY — J are the same. Hence AY. is isomorphic to 
pi(A¥) as a T-module, and (AY) = Jy (this follows from the characterization 


of J t in [Shi94, Thm. 7.14]). For simplicity, we will often denote ¢, (AY) = Jy by 


just AY. Let @ be the composite map A¥. mar J = Ay; then ¢ is a polarization 
(induced by dual of the polarization of J). Thus AY +1,;J = J and AY NIJ is 
finite. Hence, in view of Lemma 3.1, AY. and IJ provide an example of A and B 
as in the beginning of this section. 

The exponent of a finite group G is the smallest positive integer such that every 
element of G has order dividing n (i.e., such that for all x € G,nx = 0). 


Definition 3.2. The modular exponent 7, of A is the exponent of AM B and the 
modular number 7, of A is its order. 


Note that the definition is symmetric with respect to A and B. In fact, the 
definition depends on both A and B, unlike what the notation may suggest—we 
have suppressed the dependence on B for ease of notation, with the understanding 
that there is a natural choice of B (e.g., this is the case in the examples we gave 
above). If f is a newform, then by the modular exponent/number of A +, we mean 
that of A = A*, with B = J; J. In this situation, since ¢ is a polarization, Na, 18 
a perfect square (e.g., see [ASO5, Lemma 3.14]). When A+ is an elliptic curve, 
is multiplication by the modular degree m;. Hence AN B = ker(@) is (Z/m;Z), 
and so for elliptic curves, the modular exponent is equal to the modular degree and 
the modular number is the square of the modular degree. 

If R is a subring of C, let $2.(R) = S2(T; R) denote the subgroup of S(T; C) 
consisting of cups forms whose Fourier expansions at the cusp oo have coefficients 
in R. There is a T-equivariant bilinear pairing T x S2(Z) — Z given by (ft, g) b& 
a(t(g)), which is perfect by [AU96, Lemma 2.1] (see also [Rib83, Theorem 2.2]). 
The action of T on H,(J,Z) is a faithful representation that embeds T into 
Matoa(Z) = Z24”, But Z is Noetherian, so T is finitely generated over Z, and 
hence so is S(Z). Let T, be the image of T in End(A), and let Tg be the image 
of T in End(B) (since T C End(J/), T preserves A and B). Since A + B = J, the 
natural map T > Ty, @ Tz, is injective, and moreover, its cokernel is finite (since 
AN B is finite). 

Let S4 = Hom(Ty, Z) and Sg = Hom(Tz, Z) be subgroups of S2(Z) obtained 
via the pairing above. Let Ext' = Ext} denote the first Ext functor in the category 
of Z-modules. 


Lemma 3.3. There is a canonical isomorphism of T-modules 


Ext'((T4 @ Tg)/T, Z) = S2(Z)/(S4 + Sz). 
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The groups S7(Z)/(S4 + Sg) and (T4 ® Tzg)/T are isomorphic. 


Proof. Apply the Hom(—, Z) functor to the short exact sequence 
0>T—>Ty OTs > (T4 OTZ)/T> 0 
to obtain a three-term exact sequence 
0 > Hom(Ty @ Tz, Z) > Hom(T, Z) > Ext'((T4 ®Tg)/T,Z) > 0. (1) 


The perfect T-equivariant bilinear pairing T x S.(Z) — Z given by (t,g) b& 
a,(t(g)) transforms (1) into an exact sequence 


0 > S4 ® Sg > S2(Z) > Ext'((T4 @ Tz)/T,Z) > 0 


of T-modules, which proves the first claim in the lemma. Finally note that if G is 
any finite abelian group, then Ext'(G,Z) ~ G as groups, which gives the second 
result of the lemma. Oo 


Definition 3.4. The exponent of either of the isomorphic groups S2(Z)/ 
(S4 + Sg) and (T4 ® Tz)/T is the congruence exponent 7, of A and the order 
of the groups is the congruence number r,. 


Note that this definition is also symmetric with respect to A and B, and again, 
the definition depends on both A and B, unlike what the notation may suggest 
we have suppressed the dependence on B with the implicit understanding that B has 
been chosen (given A). If f is anewform, then by the congruence exponent/number 
of Ay, we mean that of A = AY, with B = J;J. In this situation, Ty = 
T/Iy and Sy = S2(Z)[J¢]. Recall that a subgroup H of an abelian group G 
is said to be saturated (in G) if G/H is torsion-free. Now Hom(T3, Z) is the 
unique saturated Hecke-stable complement of S2(Z)[J ¢] in S2(Z), hence must equal 
S>(Z)[1 ¢]+, where we recall that S2(Z)[J ¢]+ denotes the orthogonal complement of 
S(Z)[J ¢] in S2(Z) with respect to the Petersson inner product. Thus the congruence 
exponent F,, is the exponent of the group 


S2(Z) 


So(Z)[T ¢] + Sx(Z)[T¢]+ ” (2) 


and the congruence number r, F is its order. In particular, our definition of r, ; 
generalizes the definition in Section 2.1 when A; is an elliptic curve. 


Remark 3.5. If R is a subring of C, then S$2(Z) ®z R = S»(R) (see, e.g., the 
discussion in [DI95, §12]). Thus the analog of the group displayed in (2) with Z 
replaced by an algebraic integer ring (or even Z) gives a torsion module whose 
annihilator ideal meets Z in the ideal generated by the congruence exponent. 
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The following generalizes Theorem 2.1: 
Theorem 3.6. Let A and B be as in the first paragraph of Section 3. Then: 
(a) fy | Ta. 
(b) Let = To(N). If p t N, then ordy(F4) = ordy(7i4). If f € So(To(N), ©) is 
a newform, then ord, (74,) = ord, (n4,) whenever p tN. 


We give the proof of part (a) of this theorem in Section 4 and of part (b) in 
Section 5. The two sections may be read independently of each other. 


Remark 3.7. Let f € S2(T,C) be a newform. When A, is an elliptic curve, 
Theorem 3.6 implies that the modular degree divides the congruence number (since 
for an elliptic curve, the modular degree and modular exponent are the same), and 
that 14, | ra, (since for an elliptic curve, the modular number is the square of 
the modular exponent). In general, for a higher dimensional newform quotient, the 
divisibility 71 4, | V4 need not hold. For example, there is a newform of degree 24 
in S>(T%(431)) such that 


na, = (2''- 6947)" + ra, = (2). 6947). 


Note that 431 is prime and mod 2 multiplicity one fails for Jo(431) (see [Kil02]). 


4 Proof of Theorem 3.6(a) 


Since End(J) preserves A and B, we have a map End(J) — End(A) @ End(B); 
moreover, since A + B = J, this map is injective. We have the following 
commutative diagram with exact rows: 


T T 
° ane > TadTez > Me : > 0 
Y 
End(A) @ End(B 
0 —~+ End(J) —~ End(A) 6 End(B) op (A) @ End(B) a 
End(J) 


(3) 
The first two vertical maps are clearly injections, and the rightmost vertical map is 
defined naturally so that the diagram is commutative. Let 


e=(1,0)€T, OTs, 


and let e; and e2 denote the images of e in the groups (T4 ® Tg)/T and (End(A) ® 
End(B))/End(J), respectively. Since AM B is finite (in addition to the fact that 
A+ B = J), the two quotient groups on the right side of (3) are finite, so e; and e2 
have finite order. 
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Lemma 4.1. The element e. € (End(A) @ End(B))/End(J) defined above has 
order ii ,. 


Proof. By the denominator of any g € End(J) ® Q, we mean the smallest positive 
integer n such that ng € End(J). Let m4, a, € End(J) ® Q be projection onto A 
and B, respectively. Let n be the order of e2, so 1 is the denominator of z,, which 
equals the denominator of zr, (since 7, + 7, = 1y;, so that wz; = 1y — 7,4). We 
want to show that 7 is equal to 7,4, the exponent of AN B. 

Let i4 and ig be the embeddings of A and B into J, respectively. We view nz, 
and nz, as morphisms J — A and J — B, respectively. Let g = (n74,n73) € 
Hom(J, A x B); then g o (i4 + ig) = [n]4xg. We have an exact sequence 


iatip 


Cann = ae 4g 25, 
Let A be the image of AN B. Then by exactness, 
[n]A = (Yo (is + iz))(A) = 9 0 (Ga + iz )(A)) = PKOS) = {0}. 


so is a multiple of the exponentn, of AN B. 
To show the opposite divisibility, consider the commutative diagram 


Xt>(x,—x) 
0 ———+ ANB —+ AxB > J > 0 
fial (Fal) ov 
XH>(x,—Xx) x 
0 ———+ ANB —~+ AxB > J > 0, 


where the middle vertical map is (a,b) t+ (fi4a,0) and the map w exists because 
[74](A N B) = 0. But wy = ny, in End(J) ® Q. This shows that #47, € End(J/), 
ie., that 7, is a multiple of the denominator n of x4. oO 


Lemma 4.2. The element e, € (T4 ® Tg)/T has order F 4. 


Proof. We want to show that the order r of e; equals the exponent of M = (Ty ® 
Tz)/T. Since e; is an element of M, the exponent of M is divisible by r. To obtain 
the reverse divisibility, consider any element x of M. Let (a,b) € Ty ® Tz be 
such that its image in M is x. By definition of e; and r, we have (7,0) € T, and 
since 1 = (1,1) € T, we also have (0,r) € T. Thus (Tr, 0) and (0, Tr) are both 
subsets of T (i.e., are in the image of T under the map T > Ty @T3), sor(a,b) = 
(ra,rb) = (ra,0) + (0,rb) € T. This implies that the order of x divides r. Since 
this is true for every x € M, we conclude that the exponent of M divides r. Oo 
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Proof of Theorem 3.6(a). Since e is the image of e; under the rightmost vertical 
homomorphism in (3), the order of e2 divides that of e;. Now apply Lemmas 4.1 
and 4.2. Oo 


5 Proof of Theorem 3.6(b) 


Let T’ be the saturation of T = Z[..., 7,,...] in End(J/), ice., 
T’ = End(J) N(T @ Q). 


The quotient T’/T is a finitely generated abelian group because both T and End(J/) 
are finitely generated over Z. Since T’/T is also a torsion group, it is finite. 

In Section 5.1, we introduce two ideals R and S of the Hecke algebra that are 
generalizations of the notions of the congruence exponent and the modular exponent 
respectively. We will see that R C S and show that there is a natural injection 
S/R <— T’/T. In Section 5.2, we will prove that T and T’ agree locally at a maximal 
ideal of T under the condition that we call “multiplicity one for differentials”; 
we also give examples where this condition does not hold. Theorem 3.6(b) itself 
is proved at the end of Section 5.1, by applying the results of Section 5.1 and a 
proposition that is proved in Section 5.2 to show that R = S locally at a prime p 
such that p {N (when A is the dual of newform quotient, the condition that p N 
can be replaced by p? JN). 


5.1 The congruence and intersection ideals 


In this section, we work in slightly more generality, and take A and B to be as in 
the first paragraph of Section 3 (so T can be T')(V), and A need not be the dual of 
a newform quotient). Let 24: T — Ty and zg : T > Tz be the natural projection 
maps. 


Definition 5.1. With the setup as above, we define the congruence ideal as R = 
ma(ker(zp)) C Ty, and the intersection ideal as S = Anny,(A/ B). 
Lemma 5.2. We have RCS. 


Proof. By definition, R consists of restrictions to A of Hecke operators that vanish 
on B, while S consists of restrictions to A of Hecke operators that vanish on AN B. 
The lemma follows since the image in Ty of an operator that vanishes on B also 
vanishes on AN B. Oo 


Remark 5.3. By Lemma 5.2, we have a surjection T4/R — T4/S. Note that 24 
induces an isomorphism 


T ~ Ty 
od 
ker(4) + ker(zg) R 


’ 
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and we have an isomorphism 


T =~ Ta @Ts 
ns 
ker(r4) + ker(zg) T 


obtained by sending ¢ € T to (z4(t),0) € Ta @ Tg. Hence by Definition 3.4, 
the exponent of T,/R is 7, and its order is r4. Also, 74 is the exponent of AM B, 
and one expects that it is also the exponent of T4/S (certainly multiplication by 74 
annihilates T 4/5), which would give another proof that 7, | 74. Instead of pursuing 
this question, we record the following result, which will be needed later. 


Proposition 5.4. If p is a prime such that the localizations of R and S at p 
coincide, then ord, (F4) < ord, (4). 


Proof. Under the hypothesis, the surjection T4/R — T4/S is an isomorphism 
locally at p. The lemma follows from the observations above that 7, is the exponent 
of T4/R and that 7,4 annihilates T,/S. oO 


Lemma 5.5. There is a natural inclusion S/R — T/T of T-modules. 


Proof. We have 
TQ = (T1 BQ) G (Tz © Q) C (End(A) @ Q) 6 (End(B) ® Q) = End(J) @Q, 


which we use to view T and Ty as sitting inside End(J) @ Q. Also, the groups 
End(J) and T’ sit naturally in End(J) @ Q. By definition, R = T4 MT. Since an 
endomorphism of A x B factors through A x B — J if and only if it kills AN B 
embedded in A x B via x +> (x,—x), we have that S = Ty M End(J/) and this 
equals T4  T’ (since a suitable multiple of any element of T, lands in T, when 
both are viewed as subgroups of T® Q C End(J) ® Q). Hence we have R = SOT 
with intersection taken inside T’ C End(J) ® Q. Thus 


S/R=S/(SOT) &(S+1D/TOT/T. o 


If m is a maximal ideal of T, then we say that two Hecke modules, with one 
contained in the other, agree locally at m if their localizations at m are the same. 
Let J4 denote the kernel of the map T — Ty. As an immediate consequence of 
Lemma 5.5, we have: 


Proposition 5.6. Jf m is a maximal ideal of T containing I4 that is not in 
Supp,(T’/T), then the corresponding maximal ideal m/I4 of Ta is not in the 
support of S/R, i.e.: if T and T’ agree locally at m, then R and S also agree locally 
at m/ Ty. 


Remark 5.7. The ring 


T= End(J) 9 (Ty x Tg) = Tn (T,4 x Tz) 
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is often of interest, where the intersection is taken in End(J) ® Q. We proved above 
that there is a natural inclusion §/R <> T’/T. This inclusion yields an isomorphism 


S/R — T’/T, as is clear from the “if and only if” statement in the proof of 
Lemma 5.5. The ideals R and S are equal if the rings T and T” coincide. Even 
when T’ is bigger than T, its subring T” may be not far from T. 


The following lemma and proposition will not be used in the proof of Theorem 
3.6(b), but they are of interest from the point of view of multiplicity one. 


Lemma 5.8. Let p be a prime and let m be a maximal ideal of T with residue 
characteristic p. Suppose m satisfies the multiplicity one condition (i.e., J [m] is of 
dimension two over T/m). Then the completions of T and T' at m are isomorphic. 


Proof. As in [Maz77, p.92], consider the Tate module Tan(J), which is the 
Pontryagin dual of the m-divisible group associated to J(Q). Since J[m] is of 
dimension two over T/m, it follows that Ta,,(J/) is free of rank 2 over Ty, where 
the subscript denotes completion (see, e.g., [Til97, p. 332-333]). If r is an element 
of Ti then r operates T,,-linearly on Tam(J), and thus may be viewed as a 2 x 2 
matrix with entries in T,,. Further, some non-zero integer multiple of r operates 
on Tam(J) as an element of Ti, i-e., as a scalar. Thus r must be a scalar to start 


with, i.e., actually lies in T,,. Hence T.. = T,, as claimed. oO 


Proposition 5.9. Let p be a prime such that all maximal ideals m of T with residue 
characteristic p that contain I, satisfy multiplicity one. Then ord) (74) = ord) (f14). 


Proof. This follows from Lemma 5.8, Lemma 5.5, Proposition 5.4, and Theorem 
3.6(a). oO 


Proposition 5.10. Let T = To(N). Let p be a prime such that p? + N, and let 
m be a maximal ideal of T with residue characteristic p. If p|N, then assume that 
I¢ © mfor some newform f. Then T and T' agree locally at m. 


Since the proof of this proposition is rather technical, we have postponed 
it to Section 5.2. Admitting this proposition, we may now finish the proof of 
Theorem 3.6(b). 


Proof of Theorem 3.6(b). Recall that A and B are abelian subvarieties of J = 
Jo(N) such that A+ B = J, AN B is finite, and every endomorphism of J over Q 
preserves A and B. 

We first want to show that if a prime p does not divide N, then ord,(74) = 
ord, (74). In view of Theorem 3.6(a) and Proposition 5.4, it suffices to check that R 
and S coincide locally at p. By Proposition 5.6, it suffices to check that T and T’ are 
locally equal at all maximal ideals that divide p. If p + N, then this follows from 
Proposition 5.10, which proves part of Theorem 3.6(b). 

It remains to show that if f € S2(To(N),C) is a newform and p || N, then 
ord,(74,) = ord,(n4,). Note that the Hecke algebra T acts on S/R through its 
quotient T Ay = T/ Anny A¥ since the action of T on R and on S factors through 


this quotient. Thus, in view of Theorem 3.6(a) and Proposition 5.4, it suffices to 
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check that R and S coincide locally at maximal ideals of T that divide p and contain 
Anny A¥. = I; (the equality follows since J is saturated). But this follows from 
Proposition 5.6 and Proposition 5.10. oO 


5.2 Multiplicity one for differentials 


This section is devoted to the proof of Proposition 5.10 as well as a discussion of 
the notion of multiplicity one for differentials (Definition 5.13). In this section, we 
take TF = Ip(N). 

Let p be a prime such that p? + N. Let Mo(N) denote the compactified coarse 
moduli scheme associated to (NV) (as in [DR73, § IV.3]) over Zp, and let Xo(N )z, 
denote its minimal regular resolution obtained by suitable blow-up of the points 
Jj = 0, 1728 in characteristic dividing N, when they are supersingular (cf. [Maz77, 
p.63]). Let Qy,() uP denote the relative dualizing sheaf of Xo(N )Z» over Zp (it 
is the sheaf of regular differentials as in [MR91, §7]). We denote by Xo(N )r,, the 
special fiber of Xo(N)z,, at the prime p and by Qy,(yy/r, the relative dualizing 
sheaf of Xo(N )r, over F). 

The usual Hecke operators and the Atkin—Lehner involutions (corresponding 
to primes dividing N) of Jo(N) over Q extend uniquely to act on the base 
change to Z, of the Néron model of Jo(NV), which we denote by J2,- The natural 
morphism Pick, N)/z, — Jz, identifies Pigg ny/z, With the identity component 
of Jz, (see, e.g., [BLR90, §9.4-9.5]). Passing to tangent spaces along the identity 
section over Z,, we obtain an isomorphism H'(X0(N)z,, Oxon)z,) = Tan(Jz,,). 
Using Grothendieck duality, one gets an isomorphism Cot(Jz,)=H °(Xo(N igs 
QQ yo(N) /Lp)> where Cot(Jz,) is the cotangent space at the identity section (cf. 
[Maz78, p. 140]). Now the Hecke operators and the Atkin—Lehner involutions act on 
Cot(Jz,), and hence via the last isomorphism above, we get an action of the 
Hecke operators and the Atkin—Lehner involutions on H°(Xo(N Was Q yo(N) [Ends 
Following the proof of Prop. 3.3 on p. 68 of [Maz77], specialization induces an 
isomorphism 


H(Xo(N)g,, Qxo(ny/F,) & H’(Xo(N)z,, 2x wy/z,) Bz, Fp- 
In this way, we get an action of the Hecke operators and the Atkin—Lehner 


involutions on (X(N )r,, Qxo(N)/F,) as well. 
The following lemma is implicit in [Maz77, p. 95]. 


Lemma 5.11 (Mazur). Let m be a maximal ideal of T of residue characteristic p 
(recall that p? + N ). Suppose 


dimjm H°(Xo(N)g,, Qxowwy/e, EM] < 1. 


Then T and T’ agree locally at m. 
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Proof. Let M denote the group H'(Xo0(N)z,. Oyx,.(n)), Where Oxy) is the 
structure sheaf of Xo(NV). As explained in [Maz77, p. 95], we have an action of 
EndgJo(N) on M, and the action of T on M via the inclusion T C EndgJo(V) 
is faithful, so likewise for the action by T’. Hence we have an injection ¢ : T’ > 
EndyM. Suppose m is a maximal ideal of T that satisfies the hypotheses of the 
lemma. To prove that T,, = T/,, it suffices to prove the following claim: oO 


Claim: The map @|r is surjective locally at m. 


Proof. It suffices to show that M is generated by a single element over T 
locally at m, and in turn, by Nakayama’s lemma, it suffices to check that the 
dimension of the T/m-vector space M/mM is at most one. Now M/mM is 
dual to H°(X0(N)r,,Qx.y/r,)[m]. Since we are assuming that dimy/,H° 
(Xo(N)r,. Qyo(vy/F,){m] < 1, we have dimr;m(M/mM) < 1, which proves the 
claim. 


Remark 5.12. Note that Lemma 5.8 may provide an alternate route to the conclu- 
sion of the previous lemma (sometimes one can prove multiplicity one for a maximal 
ideal without relying on multiplicity one for differentials, e.g., see [Dia97]). Observe 
that in the proofs of Lemmas 5.11 and 5.8, all we needed was (locally) a non-zero 
free T-module (of finite rank, say) that is attached functorially to J. In Lemma 5.11, 
the module we used was H! (Xo(N)z,; Oy,(n)); locally, it is free because its 
reduction modulo m is of the same dimension as its generic rank (namely 1). In 
Lemma 5.8, we used the m-adic Tate module, whose reduction mod m is of the 
same dimension as its generic rank (namely 2). 


Definition 5.13. Jf m is a maximal ideal of the Hecke algebra T of residue 
characteristic p, we say that m satisfies multiplicity one for differentials if 


dimy/m(H°(Xo(N)r,,, 2x0 ny, )[m]) = 1. 


The above condition, which first appeared in [Maz77], plays an important role 
in several places, including Wiles’s proof of Fermat’s last theorem (see [Wil95, 
Lemma 2.2]). It has been used to prove multiplicity one for m (as in Section 2.2) 
and Gorensteinness of the completion of T at m (under certain hypotheses; see, 
e.g., [Til97]). 


5.2.1 Failure of multiplicity one for differentials 


In this section, we digress to discuss examples of failure of multiplicity one for 
differentials. The reader interested in the proof of Proposition 5.10 may jump to 
Section 5.2.2 below. 

By Lemma 5.11, if p? + N and if the multiplicity one condition for differentials 
holds at m, then T and T’ agree locally at m. It is thus of interest to compute the 
quotient group T’/T for various N. We compute this index in Sage [St 09]. and 
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Table 2 Nonzero Quotients T’/T for N < 325 


44 
46 
54 
56 
60 
62 
64 
68 
72 
16 
78 
80 
84 
88 
92 
94 
96 
99 
104 
108 
110 
112 
116 
118 
120 
124 
125 
126 
128 
132 
135 
136 
140 
142 
144 
147 
148 
150 
152 
153 
156 
158 


Cy 
CQ 
C3 
Cr 
Cy 
Cy 
Cy 
CQ 
Cr 
C2 


160 
162 
164 
166 
168 
169 
ligt 
172 
174 
175 
176 
180 
184 
186 
188 
189 
190 
192 
196 
198 
200 
204 
206 
207 
208 
210 
212 
214 
216 
220 
224 
225 
228 
230 
232 
234 
236 
238 
240 
242 
243 
244 


C} ® Cy ® Cg 


Cs 

CZ ®C] OCs 
C2 ® C2 

C? ® Cr @ Cg 


Cia 


Cr 

C3 ® C2 @Cy 
COC, 

C3 OBC? OC 


C] ® CZ ®Cy 


245 
248 
250 
252 
254 
256 
260 
261 
262 
264 
268 
270 
272 
275 
276 
278 
279 
280 
282 
284 
286 
288 
289 
290 
292 
294 
296 
297 
300 
302 
304 
306 
308 
310 
312 
315 
316 
318 
320 
322 
324 
325 
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C} 
Cl ® Cy ® Cg 

C$ 

CF 8 C2 @Cnr 

Cy 

C3 BC? OC? © Cis 


C] ® CP ® Cin @ Cu 
Ce, 


Ck BCR OCZ @ Cw 
Cy 

ClO C2 OCs 

C3 


obtain Table 2, where the first column contains N for N < 325 and the second 
column contains the quotient group T’/T, where C, denotes a cyclic group of 


order n. 
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In each case in which a prime p divides [T’ : T] but p? + N, Lemma 5.11 
implies that there is some maximal ideal m of T of residue characteristic p for 
which multiplicity one for differentials does not hold. For example, when N = 46, 
we find that [T’ : T] = 2, and 2? } N; thus there is a maximal ideal m of T of 
residue characteristic 2 for which multiplicity one for differentials does not hold. 

In Table 2, we observe that whenever p divides [T’ : T], then p = 2 or p? | N. 
This raises the question: is it true that if p is odd and p? } N, then multiplicity 
one for differentials holds for maximal ideals m of T of residue characteristic p? 
Lemma 5.20 below gives an affirmative answer in one direction (the other direction 
is usually easy), but under the hypothesis that if p | N then U, acts as a non-zero 
scalar on H°(Xo(N )r,,, 2x cwy/r,)m]. 


5.2.2 Proof of Proposition 5.10 
The main point is to prove that the hypothesis 
dimjm H(Xo(N )p,. Qxowwy/F, im] < 1 


of Lemma 5.11 holds for suitable maximal ideals m. This is achieved in Lemma 5.20 
below, whose proof requires an Eichler-Shimura type relation for U, (Lemma 5.15 
below). We obtain this relation by modifying the argument in [ Wil80, §5], which is 
in the T'|(V) context, to the o(V) situation. Let L denote the maximal unramified 
extension of Q, and let O; denote the ring of integers of L. For the sake of 
completeness, we state below a lemma that is well known (e.g., it is used implicitly 
in [Wil80, p. 18]); the proof was indicated to us by F. Calegari. 


Lemma 5.14. Let E be an elliptic curve over O, with good ordinary reduction. 
Then the subgroup schemes of E of order p are p copies of Z/pZ and one copy 


Of Wp. 


Proof. Let G = E[p], and consider its connected-étale sequence 
0- G°-G—>G"=0. 


Now G° is in the kernel of the reduction map, and we know that the reduction 
of E[p] has non-trivial order. Hence G* is non-trivial. By Cartier duality, G° is 
also non-trivial. Hence G*' is a Z/ pZ and by duality, G° is a p p- Thus one of the 
subgroup schemes of E of order p is a copy of w,. Let H be any other subgroup 
scheme of EF of order p. Then 1 9 has to be trivial, since otherwise H = H°® is 
a non-trivial subgroup scheme of G? = w p> hence is equal to G =p, p> Which 
has already been accounted for. Thus H is étale, and hence is a copy of Z/pZ. 
The lemma follows, since there are p + 1 subgroup schemes of order p in E[p], 
hence in FE. Oo 


We assume that p||N until just after the proof of Lemma 5.18. Let M = N/p. 
We will use the superscript to denote the subscheme of Mo(N) obtained by 
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removing the supersingular points in characteristic p. Following [DR73, VI.6.9] 
and [DR73, § V.2], the F,-valued points of Mo(N )" are in one-to-one correspon- 
dence with isomorphism classes of triples consisting of 


(a) a generalized elliptic curve E over F,, whose smooth locus we denote E*™, 
(b) a subgroup of E*™[p] isomorphic to p., or to Z/ pZ, and 
(c) asubgroup Z/MZ of E*™[M], 


such that the subgroup generated by the subgroups in (b) and (c) above meets 
every irreducible component of every geometric fiber of E over F p- Also, Mo(N \F, 
has two irreducible components, which may be described according as whether the 
subgroup in (b) is isomorphic to pw, or to Z/ pZ. As mentioned earlier, Xo(N \F, is 
obtained from Mo(N Fr, by suitable blowups and consists of two copies of Xo(M )F, 
identified at supersingular points, along with some copies of P! (see the description 
of Xo(N zg, on p. 175-177 of [Maz77] for details). One of the copies of Xo(M x, 
corresponds to the irreducible component of Mo(N ME, where the subgroup in (b) 
is isomorphic to Z/pZ; we denote this copy by Co. The other copy of Xo(M)y, 
corresponds to the irreducible component of Mo(N )F, where the subgroup in (b) is 
isomorphic to pt ,, and contains the cusp oo; we denote this copy by C. We denote 
the copies (if any) of P! by Cz,...,C,, where r is one less than the total number of 
irreducible components of Xo(V )g.. 

The usual endomorphisms U, and W, of Jo(N) over Q can be extended by base 
change to L, and extend uniquely to act on the Néron model of Jo(N) over Ox. 
Since the formation of Néron models is compatible with completions and unramified 
base change, this action is compatible with the already-defined action on the Néron 
model of Jo(V) over Z,. The identity component of the special fiber of the Néron 


model of Jo(N) over Ox is Pic, (N)/F,? whose maximal abelian variety quotient 
Pp 


is [Es Pice, (cf. [DR73, 1.3.7] and [BLR90, §9.2, Example 8]). Thus we get an 
i/Mp 

0 

Xo(N)/Fp 

Frobenius morphism on Cy /F,* 


action of U, and W, on Pic and on | ];~ Pic?. jr,’ Let Frob, denote the 
i/Ep 


Lemma 5.15. The endomorphisms U, and W, of [];<o Pic? _ satisfy Up, = 


Ci/Fp 
Froby + (p — 1)Wp on Pice, jp. - 
Proof. The proof is a modification of the proof of Theorem 5.3 in [Wil80], along 
with some details borrowed from the proof of Theorem 5.16 in B. Conrad’s appendix 
to [RSO1]. 

It suffices to check the desired identity on a Zariski dense subset of 
Pic cy/F, (F,) = J(Co)(F p), where J(Co) is the Jacobian of Co. If g is the genus 
of Co, then fixing a base point, we get a surjection Cy —> J(Co). Hence if U is 
any dense open subset of Co (F,), then U§ hits a Zariski dense subset of J(Co)(F p). 
Taking U to be the ordinary locus of Co(F p), it thus suffices to prove the desired 
identity on divisors of the form (Q) — (Q’), where the elliptic curves corresponding 
to Q, QO’ € Co(F,) are ordinary. 
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Let Mo(N) denote the algebraic stack over O, associated to [9(V) by [DR73, 
IV.3.3, IV.4.2], whose associated coarse moduli scheme is Mo(NV) (over O,). Let 
a : Mo(N) — Mo(N) denote the associated natural map. If k = F,, or an algebraic 
closure of L, then z is an isomorphism on k-valued points, and so we will often 
identify points on Mo(N)(k) with points on Mo(N)(k). Let Q be an ordinary 
point on Co(F,). Then Q is given by a triple (E,C, D), where E is an ordinary 
elliptic curve over F p> C is a subgroup isomorphic to Z/pZ, and D is a subgroup 
isomorphic to Z/MZ. We can choose a Weierstrass model E — Pe _ lifting Ez 
then F is canonically an elliptic curve by [KM85, Chap. 2]. By Pena 5, 14 and its 
proof, there is a subgroup C of E isomorphic to Z/pZ that lifts C. Also, as argued 
in [RSO1, p. 219], there is a subgroup D of E isomorphic to Z/MZ that lifts D. 
Then (E,C, D) gives a point on Mo(N)(O_) (cf. [DR73, V.1.6]), whose image 
in Mo(N)(Q_) corresponds to a point P in Xo(N)(O_) (since EF has ordinary 
reduction). We will use a bar to denote specialization. Thus we have Q = P. 
Similarly, given another point Q’ € Co(F,), we will denote the corresponding 
associated quantities by a prime superscript (thus P’ in Xo(N)(O_) denotes a 
lift of Q’, etc.). As mentioned in the previous paragraph, it suffices to prove the 
relation claimed in the lemma for elements of the form (Q) — (Q’) in Pics, iF, (F pe 


Viewing P and P’ as relative effective Cartier divisors of degree one, we see 
that U,((Q) — (Q’)) is the image of U,((P) — (P’)) under specialization, i.e., 


Up((Q) — (Q")) = Up((P) — (P’)). 

We next compute U,((P) — (P’)). Now Pic? Xo(N)/O, 18 the identity component 
of Jo(N )o,, and we have Jo(N Jo, (Ox) = Jo(N)(L) & Jo(N)(L), where L is an 
algebraic closure of L. Denoting base change to L by a subscript L, we have 


Up ((Ez. Cr, Dr) — (Ez, Cy, D5) 


= )0(Ep/Az. (Cp + Az)/Az, (Dz + Az)/Az) 
AT 

— \(EL/AD (Ch + AD/AL (DE + AD/AD, =O 
Af, 


where Az runs through the subgroups of Ez of order p except Cz (and similarly 
for A‘). Enlarging L by a finite extension if needed (which does not change the 
residue field F,) we may assume that there are p + 1 subgroups of order p in E_. 
Their scheme-theoretic closures in E over O, are the subgroups schemes mentioned 
in Lemma 5.14. If A is a subgroup scheme of E of order p, then we denote 


the quotient map E — E/A by ay. Consider the Cartier divisors corresponding 
to U,((P) — (P’)) and to 


(2(E/pp.%y,,(C).eu,(D)) + ) > x(E/B, clan (C)),a8(D))) 
B 


—(2(E'/w,. oy, (C1), ety, (DD) +) 2 (E'/B', cle (C’)), a8(D'))), 


B’ 
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where B runs through the subgroups of E isomorphic to Z/pZ except for C, and 
cl(ag(C)) denotes the Zariski closure of ag (C) in E/B (and similarly with prime 
superscripts). These two divisors coincide since they induce the same L-point by (4). 

Passing to special fibers, and noting that the special fiber of the Néron model 
of E/A is given by E/A, we find that 


Up((Q) — (Q")) = U,((P) — (P) 
= (E/W;.%,©.%,(D)) + )E/B.Aes©).@2(D))) (5) 
B 


—(E'/w7, Gy, (C1), Zu, (D)) + EB’ Man (CY). za(D))), (6) 
B’ 


where B again runs through the subgroups of E isomorphic to Z/ pZ except for C 
(and a similar statment holds with prime superscripts). 


Let F, denote the relative Frobenius map E > E ” over F,,. Now wp, is in 
the kernel of F,, and since the quotient map @, has the same degree as Fp, 


there is an isomorphism ¢ : E/p, 5 FE” such that Fi, = po Oy, Also @ 
induces an isomorphism @,, , (C) = Gc and @, , (D) 5S D”. Thus the first 
term in (5) is identified with Ee Ce Dp which is the image under Frob, 
of P = (E,C, D). Similarly, the first term in (6) is Frob p (P’). 

As for the sum over B in (5), note that in each term, we are quotienting by a 


group B which is isomorphic to Z/pZ, and hence cl(ag(C)) is of w,-type. In a 
manner similar to the computation of the action of U,, we find that 


W,((E, C, D) — (E’,C’,D’) 
= (E/C, E[p]/C,(D + C)/C) (7) 
=((A7C",B [pljc', (2 +0 7/C 7, (8) 


Considering that P = (E,C, D), with C isomorphic to Z/ pZ, we see that E[p]/C 
is isomorphic to pt ,. Also, if B is as in the sum in (5), then B isa Z/pZ, but there is 
only one copy of Z/pZ in E, since E has good ordinary reduction; hence B = C. 
Thus each of the terms in the sum over B in (5) is the term in (7). A similar statment 
holds with prime superscripts (viz., each of the terms in the sum over B’ in (6) is 
the term in (8)). 

The lemma now follows from the previous two paragraphs. Oo 


Since we are assuming that p||N, the curve Xo(N yr, has ordinary double 


point singularities, and so the differentials in H°(Xo(N Je, 2xqcy/F,) May be 


possible poles are at points on [];_, C; lying over an intersection point of two 
components in X(N \F, and where the sum of the residues at the points lying 
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over an intersection point is zero; such differentials are called regular differentials 
(see [Con00, §5.2] for the justification that the relative dualizing sheaf under 
Grothendieck duality is indeed the sheaf of regular differentials). By a holomorphic 
differential in H°(Xo(N Je, 2x cvy/F,)» We mean a regular differential all of whose 
corresponding @; have no poles at all (i.e., for all i, @; € H°(C, Qc, /F,) ). The 
subspace of holomorphic differentials may be identified with []}_) H°(C;, Qe, /F,) 
(which we will often do implicitly), and we let i; denote the corresponding injection 
Tao H°(C:, 26, 8,) > H(XoN)p, » 2 xo avy) 

In a manner similar to the description in the third paragraph of Section 5.2, 
Grothendieck duality gives an isomorphism 


@: H°(Xo(N)o,, Qxowy/o1) > Cot(Picy,.yy/0,)> (9) 


where Cot denotes the cotangent space at the identity section. Since we have an 
action of U, and W, on Picy.( y/o, (by viewing it as the identity component of the 
Néron model of Jo(N) over O,), we may use © to get an action of these operators 
on H°(Xo(N)o,, Qxo(N)/O,)- AS before, Prop. 3.3 on p. 68 of [Maz77] implies 
that base change to F, gives an isomorphism 


H°(Xo(N) ge,» 2yoivy/F,) & H’(Xo(N)o,, 2xowy/o1) Bo, Fp. (10) 


From this, we get an action of U, and W, on (X(N Jz, Qyovy/F,)- 


Corollary 5.16. The endomorphisms U, and W, of H°(Xo(N )g,,, Qyo(wy/F,) pre- 
serve the subspace |];<) H°(C, Qo fF, ), and satisfy Up = +Frob;, + 
(p — 1)W, on H°(Co, Qo /F,)» where Frob;, denotes pullback by Frob, and where 
we have a possible sign ambiguity + (which will not affect us later). 


Proof. The proof is based on the following diagram; we describe below some of the 
maps in it that have not been defined yet. 


0) 


H(Xo(N)o,, 2x.()/0,) ————> Cot(Picy, «yy /0,) 


IU m2 
x 6 
0 _ _ 0 
H (Xo(N gz,» 2x.cy/F,) > Cot(Picy yy /e,) 
\ 


TTi=o 4°(Ci. 2¢,/R,) 


TTi=o Cot(Pic’, jp, )- 


40 A. Agashe, K.A. Ribet, and W.A. Stein 


Firstly, Cot always denotes the cotangent space at the identity section. The 
map sz, is obtained by base change to F,. By (10), 2; is surjective. The map zr is 
obtained by observing that Pic, (N)/F, is the identity component of the special fiber 
of the Néron model of Jo(N) over O,, and hence maps to the identity component of 
the Néron model of Jo(N) over O;, which is Pic? Xo(N)/O,- The map @ is obtained 
using Grothendieck duality. The compatibility of Grothendieck duality under base 
change (see [Con00]) implies that the top square in the diagram above commutes. 

Now we have already defined actions of U, and W, on Picy,( nyo, and 


on PIC ca iF Gust before Lemma 5.15). Thus we get actions of U, and W, 
P 
0 


on Cot(Pic, N)/o,) and on Cot(Pic X(N)/ 
we see that zz is compatible with the actions on its domain and codomain. Recall 
that we used the isomorphism @ to induce actions of U, and W, on H°(Xo0(N)o,, 
Qx,(v)/O,) and then used formula (10) to get actions on H(Xo(N )g,,, Q y(y/F,): 
Thus © and z are also compatible with the actions of U, and W, on their domain 
and codomain. Let w € H(X0(N)g,. Qywy/F,)> and let 2 € H°(X0(N)o,, 
Qyx iN)o,) be such that m(2) = w. Then O(U,(@)) = O(m(Up(Q))) = 
m(OUp(Q))) = m(U,(O(Q))) = Up(r(O(Q))) = Up(0(r2(2))) = 
U,(6(@)). Thus we see that the isomorphism 6 is compatible with the action of U, 
(and similarly for W,,) on its domain and codomain. 

Now we turn to the bottom square in the diagram above. As mentioned earlier, the 


;,)- From the definitions of these actions 
P 


injection iz arises because [];_ Pics, iF is the maximal abelian variety quotient of 
i/Mp 
the identity component Pics, (N)/F of the special fiber of the Néron model of Jo(V) 
P 


over O,. The map 6’ is the isomorphism coming from Serre duality. 

Next, by [Con00, §5.2], the Grothendieck duality isomorphism 6 is the same 
as the isomorphism coming from the duality theory of Rosenlicht (as in [Ser88, 
Chap. IV]), perhaps up to multiplication by —1. Assume for the moment that 
there is no sign ambiguity, so that @ is indeed the isomorphism coming from the 
duality theory of Rosenlicht. One can check that the Serre duality isomorphism 6’ 
is induced by the Rosenlicht duality isomorphism 6 via the inclusions i; and iz by 
looking at the proof of the two dualities in [Ser88, Chaps. II and IV]. Note that 
in [Ser88], the curve X over the field k (notation as in loc. cit.) is assumed to be 
irreducible. This hypothesis is needed in loc. cit. (for our purposes) only to show 
that H'(X,k(X)) = 0 (p. 12, loc. cit.); the latter condition holds so long as X is 
reduced (see top of p. 165 in [AK70], as well as the bottom of p. 138 and top of 
p. 132 therein), which is true in our case (taking X = Xo(N gr, and k = F,) : 
We remark that our contention that the Serre duality isomorphism 6’ is induced by 
the Rosenlicht duality isomorphism @ via the inclusions i; and 72 also follows from 
Section 6 (an appendix provided to us by Brian Conrad), by taking C = Xo(N \F, 
and C’ to be any of the C; in Section 6. In any case, we conclude that the bottom 
square in the diagram above commutes as well, perhaps up to multiplication by —1. 

Now the action of U, and W, on [];_ Pict, jp, Was defined by identifying 


it as the maximal abelian variety quotient of Pic® 


Xo(N)/Fp" Thus we see that i> 1s 
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compatible with the action of U, and W, on its domain and codomain. Considering 
that moreover the isomorphism @ is compatible with the action of U, (and W,) and 
the bottom square in the diagram above commutes, perhaps up to multiplication 
by —1, we see that U, and W, preserve [];<9 H°(C;, 2¢,/%,). Now since 4 is 
compatible with the action of U, and W, on its domain and codomain, so is 6’. 
Thus we may use the isomorphism 06’ to translate the identity in Lemma 5.15 
from the right to the left of 6’ to get the desired identity in the corollary, where 
the + ambiguity in front of Frob,* is really due to the sign ambiguity about the 
compatibility of the action of U, and W, on the two sides of the isomorphism 6’. 0 


Remark 5.17. We defined the action of the Hecke operators and the Atkin—Lehner 
involution in characteristic p from their definition in characteristic 0 in a somewhat 
indirect manner via the Néron mapping property, Grothendieck duality, etc (cf. 
beginning of Section 5.2). This has made our proofs rather complicated, since we 
have to show several compatibilities (as in the previous Corollary 5.16 and the 
upcoming Lemma 5.18). After this article was written, B. Conrad pointed out to us 
that one can define the action of the Hecke operators on suitable Artin stacks over Z 
for o(V )-structures (see [Con07]) in such a way that the definition agrees with the 
usual definition of the Hecke operators over Q. This naturallly defines the action of 
the Hecke operators on objects related to Xo(N) such as differentials, Picard groups, 
etc., in characteristic p and these definitions are automatically “compatible” with the 
corresponding definitions in characteristic zero. This alternative method would have 
been a less complicated way to proceed. 


By [Maz77, Prop. II.3.3] we have an isomorphism 
H°(X0(N)g,,, Qyowy/F,) = H°(Xo(N)r,, 2xowy/F,) @r, Fp, 


using which we may identify H°(Xo(N)r,, Q yo()/Fp) as a subspace of 
H°(Xo(N )g,. Qy,(yy/F,)- Just before Corollary 5.16, we defined an action of Up 
(and W,) on H°(X0(N)g,. 2x,~y/e,): 

Lemma 5.18. The action of U, (respectively W,) on H°(Xo(N)p, Qywwy/F,) 
preserves the subspace H°(X0(N)r,. Q x0(N)/F, ), and agrees with the action of Up 


(respectively W,) on this subspace that we defined earlier in the third paragraph of 
Section 5.2. 


Proof. We have the following diagram, obtained by the obvious base changes: 


(3) 
H°(Xo(N)g,, 2x, cnye,) < H°(XoN )o,, 2x «wyo,) > CotPicy yo.) 
h ° 7 i 
H°(Xo(N)r,, 2xony/ep) — H°(Xo(N)z,,2xiwy/z,) > CotlPic¥~y/z,)> 
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where the map ©’ is the isomorphism coming from Grothendieck duality as 
discussed in the third paragraph of Section 5.2. Now the action of U, and W, 
on Cot(Picy, wwy/o,) = Cot(Jo(N)o,) (where Jo(N)o, is the Néron model 
of Jo(N) over O;) was obtained by base changing from Z,. Considering that 
the formation of Néron models is compatible with completions and unramified 
base change, we see that the rightmost vertical map above is compatible under the 
action of U, and W,. Also, the action of U, and W, on H°(Xo(N)o,, 2 x(wy/O1) 
(respectively on H°(Xo(N Jp,» 2 xo(vy/F,)) Was obtained via the isomorphism © 
(respectively ©’). Thus the rightmost two horizonal maps above are also compatible 
under the action of U, and W, on their domain and codomain. Finally, the 
compatibility of Grothendieck duality under base change (see [Con00]) implies that 
the right square in the diagram above commutes. Arguing as in the third paragraph 
of the proof of Corollary 5.16, one sees then that the middle vertical map above is 
compatible under the action of U, and W,. 

Now the already-defined action of U, and W, on H°(Xo(N )F,> Qxo(N) /F,) in the 
third paragraph of Section 5.2 is obtained via the lower leftward pointing arrow in 
the diagram above, and the action of U, and W, on H°(X0(N Ye,» 2x vy/F,) 18 ob- 
tained via the upper leftward pointing arrow in the diagram above. Thus the leftmost 
two horizontal arrows are compatible under the action of U, and W, on their domain 
and codomain. Repeated applications of [Maz77, Prop. II.3.3] show that the left 
square also commutes. Using all this, we see that the action of U, (respectively W,) 
on H°(Xo(N)r,,,2x,(wy/F,) Viewed as a subspace of H°(Xo(N)p, Qywwy/F,) 
agrees with the action of U, (respectively W,) on H°(X0(N )F,> QQ yo(N) /F,) defined 
in the third paragraph of Section 5.2, and in particular that U, and W, preserve this 
subspace. Oo 


We now revert to the assumption that p is a prime such that p? + N (in 
particular p may not necessarily divide NV). The Tate curve over F , [[q]] gives rise to 
a morphism from Spec F,,|[q]] to the smooth locus of Xo(V)r, — Spec Fp. Since 
the module of completed Kahler differentials for F,|[q]] over F, is free of rank 1 
on the basis dq, we obtain a map 


q-exp : H°(Xo(N)r,, 2xocwy/e,) > F pllall. 


If p + N, then by a holomorphic differential in (X(N )r,, Qyxo(N)/Fp)> we 
mean any differential in A°(Xo(N)r,, Q xo(N)/F,)- 


Lemma 5.19. Recall that p is a prime such that p? + N, and m is a maximal 
ideal of T with residue characteristic p. If p|N, then assume that Up acts as a 
non-zero scalar on H°(X0(N)g, Qy (yr, )[m]. Then the map q-exp restricted to 
homomorphic differentials in H°(Xo(N)r,, Qyo(vy/F,)[m] is injective. 


Proof. The essential argument is quite standard, going back to Mazur, so we 
only sketch the ideas. For some of the details, we refer the reader to the proof 
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of Lemma 4.2 in [ARSO6]. If p + JN, the injectivity follows from the q- 
expansion principle. So suppose that p || N, and let M = N/p. Recall that 
Xo(N MF, is obtained from Mo(N Fr, by suitable blowups at supersingular points 
and consists of two copies of Xo(M JE, identified at supersingular points, along with 
some copies of P!. Suppose w € H°(X0(N)r,, Q yo(ny/F,){m] is a holomorphic 
differential that is in the kernel of g-exp. Then the g-expansion principle implies 
that w vanishes on the copy of Xo(M Jz, containing the cusp ov, ie., on C}. 
By Corollary 5.16 and Lemma 5.18, we have U,(w|c,) = +Frob,*(@|c,) + 
(p—1)W,(@|c,). But pullback by Frob, is the trivial map and W, swaps Cp and Ci, 
so U,(@lc.) = (p — 1)(@|c,) = 0. Now by hypothesis, U, acts as multiplication 
by a non-zero scalar, hence @ is trivial on Co. Thus @ is trivial on both copies 
of Xo(M MF, One can show that then w is trivial on the copies of P!' as well 
(see the proof of Lemma 4.2 in [ARS06]). Thus @ is trivial on Xo(N ae hence 
on Xo(N )r,. oO 


Lemma 5.20. We continue our hypotheses that p is a prime such that p? + N, m 
is a maximal ideal of T with residue characteristic p, and if p|N, then U, acts as a 
non-zero scalar on H°(Xo(N)r,. Q y(n), )[m]. Then 


dimy/m H°(Xo(N)g,, Qxo ny, Im] < 1. 


Proof. The idea behind the proof is the same as in the proof of Lemma 2.2 in [Wil80, 
p. 485-487], which in turn builds on ideas from p. 94-95 of [Maz77]. However, 
parts of our arguments are somewhat different, and may be considered alternatives 
to some of the methods in the works cited in the previous sentence. 

Ifwe H°(X0(N)r,,. Q yo(N)/F,) andn > 1, then let a,(@) denote the coefficient 
of g” in g-exp(@). We have a pairing H°(Xo0(N)r,. Q yx (v)/F,) x T + F, that takes 
(w, T) to a,(T@). This induces a map 


W : H°(Xo(N)g,. 2x wy/F,) [mM] > Home, (T/m, F >), 
which is a homomorphism of T/m-vector spaces. 


Claim 1: If wm € ker(y), then g-exp(q) is trivial. 
Proof. Following the proof of Prop. 3.3 on p. 68 of [Maz77], we have 


H°(Xo(N)g,, 2xowy/k,) & H(Xo(N)z,,. 2xo(wy/Z,) zy Ep, (11) 
and 
H°(X0(N)(C), Qxwyyc) & H(Xo(N)z,, 2x wy/z,) Bz, C. (12) 


The definition of the action of the Hecke operators on H°(Xo(N Vis QQ yo(N) j%5) 
defined in the third paragraph of Section 5.2 shows that this action is compatible 
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with the action of the Hecke operators on H°(Xo(N)(C), Q x,cwy(@y/c) under (12). 
Also, the action of the Hecke operators on H°(Xo(N yr,» Qo) je) was defined 
in the third paragraph of Section 5.2 via their action on H°(X0(N)z,,. Q xo(N)/Zp) 
using (11), so these actions are clearly compatible under (11). Now 


H°(Xo(N)(C), 2x wwyo/c) & HVo(N)(C), Qjwwy@/c) S S2(To(N), ©), 


and thus a\(T,@) = d,(@) for w € H°(Xo(N)(C), Qy ~wy(c)/c). Hence, by (11), 
(12), and the discussion above, we also have the formula a|(7,@) = a,(q@) for 
w € H°(Xo(N)r,, 2x ny/F,)- 

Thus if w € ker(y), then a,(@) = a,(T,@) = 0 for alln > 1, i.e., g-exp(@) is 
trivial, as was to be shown. oO 


Claim 2: The T/m-dimension of the subspace of holomorphic differentials in 
H?(X0(N)r,. Qy,wy/r,)[m] is at most 1. 


Proof. If w is a holomorphic differential in H°(Xo(N )p,, 2xq.wy/r,)[m] and y(w) 
= 0, then by Claim 1, g-exp(@) is trivial, and hence by Lemma 5.19, is trivial. 
This proves that y is injective when restricted to the subspace of holomorphic 
differentials. Now the group Homy,(T/m,F,,) has the same size as T/m, which 
completes the argument because w embeds the subspace of holomorphic differen- 
tials in H°(X0(N)r,. Qyvy/F,){m] into Homr,, (T/m, F,), which has dimension 1 
as a T/m-vector space. Oo 


Claim 2 proves the lemma in the case when p + N. We now prove that 
dimy/m H°(X0(N)r,. Qyovy/e,){m] < 1 when p||N, which will finish the proof 
of the lemma. Following the proof of Lemma 2.2 in [Wil95], we break the argument 
into two cases: 


Case I: There is no non-zero holomorphic differential in 
H°(X0(N)g,, 2xowy/e,) fm]. 


Suppose @; and @ are two differentials in H°(Xo0(N)r,. Qy,cvy/F,){m]. Then we 
can find a pair (u,A) € (T/m)* with (u,A) 4 (0,0) such that py(a,) — 
AW(@2) = 0, Le, W(ua, — Aa) = 0. Hence by Claim 1, g-exp(ua1 — 
Aan) = 0. Viewing 4@, — Aw, as an element of H°(X0(N)g,. Qyo(vy/F,)> we 
see that ;4@; — Aw vanishes on C (recall that C; is the copy of Xo(N/D)g, that 
contains the cusp oo) by the “g-expansion principle” (see the proof of Lemma 4.2 
in [ARS06] for details). Now Cz,...,C, (the copies of P!) arise as chains that 
link C, and Cp (recall that Cp is the copy of Xo(N/ PF, that does not contain the 
cusp oo) and each of C,...,C, has at most two points of intersection, with all 
intersection points being ordinary double points (see the description of Xo(NV )F, on 
p. 175-177 of [Maz77] for details). Taking into consideration the definition of 
regular differentials and the residue theorem we see that j4@, — Aw is holomorphic 
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on the curves among C2,...,C, that intersect C; (for details, see the proof of 
Lemma 4.2 in [ARS06] in a similar situation). Now a curve among C>,...,C; 
that does not intersect C, intersects exactly one curve among C),...,C, that does 
intersect C;. Hence by repeating the argument above, 4w, — Aw is holomorphic 
on each curve in C2,...,C, that does not intersect C; as well. Thus 1a, — Aa@ is 
holomorphic on all of Xo(NV )F, except perhaps on Co. But the only possible poles of 
[La — Aw? on Cp are over points of intersection with other components, and again, 
considering the definition of regular differentials, we see that there are no such poles, 
ie., 4@, — A@ is holomorphic on Cp as well. Thus (4@; — Aw? is holomorphic 
everywhere and is an element of H°(Xo(N Pe Q2.X0(N)/F » tm]. Hence it is trivial by 
the hypothesis of this case. Thus w; and w2 are linearly dependent. Since @, and w2 
were arbitrary, this shows that dimy/m H°(X0(N)r,, Qy.(vy/F,)[m] < 1 in this case. 


Case IT: There is a non-zero holomorphic differential 
WME H°(X0(N)r,. Q yo vy/F,)[m]. 


By Lemma 5.19, qg-exp(@) is non-trivial, and so by Claim 1, w(w) 4 0. 
Let w’ € H°(Xo(N)r,, 2xocwy/e, Im]. Then there is a A € T/m such that 
W(o') —Aw(@) = 0, ie. w(@’ — Aw) = 0. As in the proof of Case I, we 
conclude that w’ — Aw is holomorphic; in particular w’ is holomorphic. Thus 
every differential in H°(Xo(N )r,» 2x0(y/R,)[M] is holomorphic. Then by Claim 2, 
dimy/m H°(X0(N)r,, Q yx (vy/F,)[m] < 1 in this case as well. oO 


(Proof of Proposition 5.10). Recall that the hypotheses of Proposition 5.10 are 
that p is a prime such that p* + N, m is a maximal ideal of T with residue 
characteristic p such that if p|N, then Jy C m for some newform f. We wish 
to show that then T and T’ agree locally at m. 

If p + N, then the result follows from Lemmas 5.11 and 5.20. If f is a 
newform and p|N, then U, acts as +1 on f, and hence U, + 1 € Jy. Thus if 
p|N and If C m for some newform /, then U, acts as a non-zero scalar (+1) 
on H°(Xo(N )r,> QQ y(n) je, ){m] (note that the action of U, on regular differentials 
was defined compatibly with the usual action of U, on complex differentials, i.e., on 
cuspforms; cf. the proof of Claim | in the proof of Lemma 5.20). The proposition 
follows again from Lemmas 5.11 and 5.20. oO 


6 Duality theory: an appendix by Brian Conrad 


Let k be a field and let C be a proper reduced k-scheme with pure dimension 1. 
Assume that C is generically smooth, and let C’ C C be a non-empty reduced 
closed subscheme with pure dimension | (so C’ is also generically smooth). The 
case of most interest to us is when C is a geometrically connected and semistable 
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curve and C’ is a smooth geometrically irreducible component. The inclusion C’ > 
C induces a natural map of k-groups Picc;; — Picc’/;, and on tangent spaces at 
the identity this is the canonical pullback map 


6: H'(C,Oc) > H'(C’, Oc’) 


(as we see by computing with dual numbers over k). Each of C and C’ satisfies 
Serre’s condition (S;) by reducedness, so each is Cohen—Macaulay. Thus, by Serre 
duality we can identify the map of cotangent spaces with the map H°(C’, wc, Jk) > 
H°(C, @c/~) dual to 6. We wish to give a concrete description of this latter map. To 
do this, we first review some basic definitions and identifications in duality theory. 

In what follows we use Grothendieck’s approach to duality theory, which has the 
merit of permitting more localization operations than in Serre’s approach. Since 
C and C’ are Cohen—Macaulay with pure dimension 1, their relative dualizing 
complexes over & are naturally identified with wc,;[1] and wc7/,[1] respectively 
{[Con00, 3.5.1]. Since (by construction) the formation of the relative dualizing 
complex is compatible with Zariski-localization on the source, we have canonical 
isomorphisms @cer/x|csm & Q orm 5 and wo/kK|cs & Qosm 5 that coincide on the 
open locus U = CS" C’ that is dense in C’ (and supported in C’""). If we let 
j :C8™ > C and j’ : C’/™" —> C’ denote the canonical dense open immersions 
then, by [Con00, 5.2.1] the natural maps 


WC'/k > Fa (Qorsm j,)s WC/k > Je (QEsm/p) 


are injective. By construction this is compatible with the natural isomorphism 
wc/klu X @c’/klu. Letting n : Spec(K) — C and n! : Spec(K’) > C’ denote 
the canonical maps from the schemes of generic points, wc’/; maps isomorphically 
onto a coherent subsheaf of 1, (Qe / ,) and likewise for wc/, in ne (Qh / ,J; these 
image subsheaves are the so-called sheaves of regular differentials, and a classical 
result of Rosenlicht describes these images explicitly using residues when k is 
algebraically closed [Con00, 5.2.3]. We will not require Rosenlicht’s result for the 
statement or proof of the theorem below. 

Using Grothendieck’s theory of relative trace maps, the canonical closed im- 
mersion 4 : C’ > C over k induces a trace morphism Tr, : tx(@c7/k) > @c/k 
whose formation commutes with Zariski-localization on C, so over the dense open 
U = 1«'(C™) C C’ it induces the natural isomorphism wc'klu & @cyklu; 
or equivalently it is the identity map on OF [ke Hence, Tr, is compatible with the 
canonical inclusions wc;,. > Me (Qn K) and wc/k > Nx (Qk): In particular, 
the map Tr, is compatible with the natural identification of meromorphic 1-forms on 
C’ with meromorphic 1-forms on C (i.e., compatible with the injection eee ig 
Ve jp: 

Having summarized some inputs from duality theory, we can now state the result 
we want to prove. 
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Theorem 6.1. The pullback H'(C,Oc) — H!(C', Oc:) is dual to the natural map 
H°(C', wrx) = H°(C, tx(@erjx)) > H(C, w/e). 


Proof. Let Trc : H'(C,@cj~) — k and Tre : H'(C’,a@cr/x~) > k be the 
canonical trace maps, so our problem is to prove that for s € H'(C,Qc) and 
& € H(C’, c'/k) = erie 

Tre (&" U s|cv) = Tre (Tr. (&') U s) 


in k. By the functoriality of Grothendieck’s trace map, Trcy = Trc o H!(Tr,) as 
maps H!(C’, @c//«) > k. Thus, it suffices to show that the map H!(C’, @cr/x%) > 
H!(C, @c/x) induced by Tr, carries &’ U s|c, to Tr,(&’) Us. We may view dualizing 
sheaves as subsheaves wc/x © Nx(Qe 4) and wer/K & Ne (Qe 74) in terms of which 
we have seen that the abstract trace map Tr, is induced by the natural inclusion 


Ver & Vek: 


To do the computation we work with Cech theory. Let {U,,} be an ordered finite 
open affine cover of C and let U) = U, NC’, so {U/} is an open affine cover of 
C’. The cohomology class s corresponds to a Cech 1-cocycle {5y i }n<m With Sym € 
Oc (Un NO Um), so s’ corresponds to {s7, ,,} with s) ,, = Snmlusnu;. Identifying & 


nm nm 


with an element of erie &’Us|c: € H'(C’, wc’/~) corresponds to {S), m& Jn<m and 
Tr,(&’) Us € H'(C, @c/x) corresponds to {8n,m&}n<m, where &’ is viewed in Vek 
in the natural way. The product sy,’ at the generic points of U;, AQ Uy vanishes at 
generic points not in C’, so the required equality is clear even at the level of Cech 
1-cocycles. Oo 
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Abstract We give a new proof of the Siegel-Shidlovsky theorem, which is based 
on a new version of Shidlovsky’s lemma and on M. Laurent’s interpolation 
determinants. We also establish a dual version of the lemma, and yet another proof 
of the theorem when the monodromy around 0 is trivial (as in the Lindemann— 
Weierstrass case). 
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1 Introduction 


Dans cet article, nous dirons a la suite de Serge Lang [L2] qu’une fonction entiére 
holonome f est une E-fonction si les coefficients de son développement de Taylor 
a Vorigine )>0dn - engendrent un corps de nombres K ;, et si la hauteur du point 
(do,...,d,) de P, (K ¢) croit au plus géométriquement avec 1. Nous n’aborderons 
pas la question, toujours ouverte, de comparer cette notion a la définition initiale de 
Siegel (voir 4 ce propos [R] et [A2.I]). Etant donné un corps de nombres K, nous 
dirons avec Shidlovsky [Sh] que f est une K E-fonction si, de plus, K¢ C K. Ces 
définitions s’étendent sans difficulté aux solutions entiéres de systemes différentiels 
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linéaires, et permettent, a l’aide des résultats de [A2.I], de parler de K E-vecteurs 
horizontaux d’un fibré 4 connexion sur P;/K. 

Soient n > 0 un entier, K un corps de nombres, de degré « sur Q, |.|oo la valeur 
absolue attachée 4 un plongement complexe de K (sous-entendu dans les extensions 
des scalaires de K 4 C qui suivent), M un fibré de rang n sur P; /K, muni d’une 
connexion a singularités V, et S la réunion des points 0,00 et de l’ensemble des 
singularités de V. On suppose que le point y = 1 de P;(K) n’appartient pas a S, et 
on note M, la fibre de M en I, et M, le complété formel en | du faisceau associé 
a M. Soit par ailleurs € € M, ®x C une section horizontale formelle de Vc au 
voisinage de 1. On note (€) le rang du plus petit sous-fibré de Mc dont € soit une 
section locale, et 7; (€) la dimension du plus petit K-sous-espace vectoriel W; de M; 
dont €(1) soit un point complexe. En d’ autres termes, les coordonnées €,..., €, de 
E relativement a un repere de M au-dessus d’un voisinage affine du point 1 forment 
une solution d’un systéme différentiel 


Ej Ej 
|: | =4@] : |.4@ € gh.(K@O K[k— I): G) 
(om En 


elles engendrent dans C((z — 1)) un C(z)-espace vectoriel de dimension n(€), et 
leurs valeurs en 1 engendrent dans C un K-espace vectoriel de dimension r := 
r\(€). Dans ces conditions, on a: 


Théoréme 1 (Théoréme de Siegel—Shidlovsky). on suppose que E s’étend par 
prolongement analytique en 0 en un K E-vecteur. Alors, r\(E) = me) 


Par passage aux puissances symétriques, on en déduit de la fagon habituelle I’ égalité 
des degrés de transcendance de C(z)(€),..., E,) sur C(z), et de Q(E; (1), ...,En 1) 
sur Q. 

Le théoréme de Siegel-Shidlovsky généralise le classique théoréme de 
Lindemann—Weiestrass sur les valeurs de la fonction exponentielle en des points 
algébriques. Il a connu un regain d’intérét ces derniéres années, a la suite de la 
preuve adélique que Bézivin et Robba [BR] ont donnée de son avatar originel, 
puis de son extension au cas général par Y. André [A2.I]] et, tout récemment, du 
raffinement suivant, qu’ avait conjecturé Serge Lang ([L2], p.100), et que F. Beukers 
a déduit de la preuve de [A2]: on peut, dans la conclusion du théoréme 1, supprimer 
le facteur 1/x, de sorte que r;(€) est en fait égal 4 sa valeur maximale n(E), méme 
sik £Q. 

Dans cet article, nous donnons une nouvelle preuve du théoréme | lui-méme, 
au moyen des déterminants d’interpolation de M. Laurent. A partir de paramétres 
L, To, T; dont la signification est précisée au §3 ci-dessous, il s’agit donc. 


(i) de construire un “morphisme d’évaluation” ¢ de l’espace I"(L) des sections 
d’un fibré sur P, /K, a valeur dans un K-espace vectoriel E'v(To, 71); 

(11) de montrer (lemme de zéros; voir §2) que ¢ est injectif dés que les dimensions 
de ces espaces le permettent (a une constante pres), c’est-a-dire ici dés que 
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To + rT; —nL >> 0, d’ot en prenant des bases, un déterminant mineur 
A € K,d’ordre ~ nL, non nul; 

(iii) et de majorer les différentes valeurs absolues de A, avec un soin particulier 
(méthode de M. Laurent) pour |A|.9. La formule du produit, et un choix 
convenable de L, Ty et T;, fournissent alors I’ inégalité recherchée. 


Une construction de ce type a déja été proposée par A. Sert [S] pour le théoréme 
de Lindemann—Weierstrass. Mais sa construction,“duale” de la nétre (les réles des 
paramétres nL et Ty + r7\ y sont inversés), s’appuie sur un critére de surjectivité 
pour ¢. Nous y revenons au §6 de I’article. 

Le point (iii) de notre nouvelle preuve fait I’ objet du §4. A ce propos, rappelons, 
selon J-B. Bost [Bo], que tout l’art d’une preuve de transcendance consiste 4 choisir 
judicieusement des métriques sur les espaces source et but de l’application ¢, ou 
de fagon moins canonique, des bases de ces espaces. Nous proposons au §5 de 
l'article une base de I'(L) différente de celle du §4, qui, jointe au théoréme de 
pureté d’ André [A 2.I], devrait fournir encore une autre preuve du théoréme 1, 
mais que je ne sais faire aboutir pour l’instant que dans le cas du théoréme de 
Lindemann—Weierstrass!. Ainsi, la preuve du §5 suit mot 4 mot le schéma précédent; 
seule varie la technique d’évaluation de la hauteur de A au point (iii). Et si le coeur 
lui en dit, le lecteur pourra déduire du §6 des variations de ces démonstrations, en 
remplagant simplement la condition d’injectivité de @ au point (ii) par la condition 
de surjectivité nL — (Ty +rT\) > 0. 

On peut finalement voir ce travail comme un pont jeté entre, d’une part, la preuve 
usuelle [L2], [Sh] du théoréme 1, qui démarre par une construction auxiliaire au 
point 0, et dont, a la fin du §4, nous reprenons le choix de paramétres Ty >> 7} 
pour conclure, et d’autre part, la preuve d’Y. André [A2], qui démarre par une 
construction auxiliaire au point 1, et dont, a la fin du §5, nous reprendrons le choix 
de paramétres 7; >> Ty. Dans chacune des preuves présentées ici, les constructions 
sont au contraire globales, et les points 0 et 1 jouent des réles paralléles, en reflet 
des outils sur lesquels elles reposent, a savoir: 


¢ une version du lemme de zéros de Shidlovsky relative a plusieurs points: voir le 
§2, ainsi que l’énoncé dual (lemme d’annulation) du §6; 

¢ le lemme d’interpolation de M. Laurent, énoncé au cours du §3, qui tient 
simultanément compte des points 0 et 1, et fournit aux deux preuves la méme 
majoration de |A|oo. 


2 Lelemme de zéros 


Le lemme de zéros dont nous aurons besoin se déduit de la relation de Fuchs 
généralisée (pour une démonstration plus proche de celle du lemme de Shidlovsky, 
voir [B2], §4.1). Nous allons en donner une formulation générale, dans un style 


'Ce texte est une version allégée de [B2], ot sont discutées les difficultés liées 4 son extension au 
cas général. 
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familier en théorie de Baker. Avec les notations de |’ introduction, soient L un entier 
> 0, et M*(L) le tordu par O(L) du fibré 4 connexion dual de M. Il est encore 
muni d’une connexion a singularités, qu’on note V*; pour M de type (a),...,4n); 
ou les a; sont des entiers < L, ses sections globales s’interprétent comme des n- 
uplets s = (s,,...,S,) de polynémes de degré < L—a),..., L —dy, a coefficients 
dans K, et Vidz) est représenté par d/dz(s1,...,5n) + (81,.--,8,)A. 

Soit R un ensemble de points de P)(K) a distance finie, éventuellement 
singuliers pour V. Pour tout a € FR, soit Wy un K-sous-espace vectoriel de 
Ma formé de sections horizontales pour V, c’est-a-dire, dans un repére local, de 
solutions dans K”[[z — a@]] du systéme différentiel (*); sia ¢ S, cela revient a se 
donner un sous-espace vectoriel W, de l’espace des “conditions initiales” M,. On 
dit qu’une section s de M*(L) s’annule @ un ordre = T le long de We si pour tout 
ZeE Wa la série formelle s.Z est divisible par (z — a)! c’est-a-dire si son image 
dans Oy [7] := Ox /(Z- a)’ Og est nulle. 


Théoréme 2 (Lemme de zéros). I/ existe un nombre réel c(V), effectivement 
calculable en fonction de M,V et card (R), vérifiant la propriété suivante. Soient 
{Tq,a € R; L} une collection d’entiers = 0, et s une section non nulle de M*(L) 
s’annulant, pour tout a € R, a un ordre = Ty le long de Wy. Il existe un sous-fibré 
M’' de M dont la restriction hors de S est stable sous V, sur lequel s s’annule, et 
tel que 

Yo dim(Wa/Wa OM),).Te <rk(M/M’).L + c(V). 

aEeR 


[Lorsque a ¢ S, on pourra remplacer le quotient du terme de gauche par W,,/ Wy N 
Mi] 


Démonstration. Soient M le K(z)-vectoriel a connexion défini par M, D I’ opérateur 
Va/dz sur M, M' le plus grand sous-espace vectoriel de M stable sous D et contenu 
dans I’hyperplan Ker(s) de M, M le quotient M/M’, muni de la connexion quotient 
D, et v sa dimension. Montrons que le sous-fibré M’ de M de fibre générique M’, 
sur lequel s s’annule, répond 4 la question. 


Par passage au quotient, s définit une forme linéaire 5 sur M, qui est un vecteur 
cyclique pour la connexion duale D sur M: en effet, l’orthogonal dans M du 
K(z) [D "]-module engendré par 5 est stable sous D, et contenu dans Ker(5), donc 
nul par maximalité de M’. L’annulateur dans K(z)[d/dz| de 5 est donc un opérateur 
différentiel £ = P(d/dz) de rang v = rk(M/M’). 

Pour tout vecteur horizontal Z € Wy de M, s.Z est solution de |’ équation 
différentielle £(s.Z) = (P(D*)(s)).Z = 0. De plus, s.Z = 0 si et seulement si s 
annule l’orbite de Z sous |’action du groupe de Galois différentiel d’une extension 
de Picard—Vessiot Fy/C(z) de V en a. Comme cette orbite engendre |’espace des 
vecteurs horizontaux d’un K(z)-sous-espace vectoriel Mz de M stable sous D, 
on déduit de la maximalité de M ’ que l'image de Mz dans M est nulle, et la 
dimension /tg de l'image de Wa sous s est égale a dim /Wa 1 M'(F,)), soit 
Ma = dim(Wy/We A M),). 


Le théoréme de Siegel-Shidlovsky revisité 55 


Pour tout a € 7, l’équation différentielle £y = 0 admet ainsi dans K [[z—«]] au 
MOINS /Lq Solutions linéairement indépendantes d’ ordre > T,. La collection de ses v 
exposants en @ est donc formée de jug entiers > Ty, et de v—4y nombres complexes 
dont les parties réelles sont minorables en fonction de A (et > 0 sia € S). De 
méme, les parties réelles de ses exposants a l’infini sont, a l’addition prés d’une 
constante ne dépendant que du type de M/ et de A, toutes minorées par —L. Enfin, 
les parties réelles de ses exposants aux autres singularités 0 de £ sont > Osia ¢ S, 
et minorées en fonction de A sinon. La relation de Fuchs généralisée (voir [B1], 
Thm. 2) entraine donc 


—vL + waER La Ty < c(V), 


ou c(V) désigne la somme des opposés des constantes énumérées plus haut, de 
Virrégularité globale de End(V), et de |R|n(n — 1), et le théoréme 2 est démontré. 
Nous nous restreignons désormais au cas ot R = {0, 1}, et ot dim(W) = 1, 
et nous posons r = dim(). Pour la preuve du §4, on pourrait suivre l’usage, 
courant en transcendance, qui consiste a éviter les passages aux quotients auxquels 
conduisent naturellement les lemmes de zéros grace a des contraintes numériques 
sur les paramétres (ici, L > TT). Mais cette contrainte n’est pas satisfaite dans la 
preuve du §5, ot la notion suivante s’avére utile: on dit que W est non dégénéré 
si pour tout sous-fibré M’ # M génériquement stable sous V, les quantités n’ = 


rk(M/M’') etr’ = dim(W, /W, a M') vérifient: 


a dim(W, /W nM!) = dim(W1) _ 
yo rk(M/M’) — rk(M) on 


(#*) 


Pour a = 0, ot dim(Wo) = 1, la condition de non-dégénérescence correspondante 
revient a demander que Wp ne soit pas inclus dans un sous-fibré 4 connexion propre 
de M. En reprenant l’expression c(V) du théoréme 2, on en déduit : 


Proposition 1 (Corollaire du lemme de zéros). Soient {To, T,, L} trois entiers > 0, 
et § une section de M*(L) s’annulant, pour a = 0,1, aun ordre = Ty le long de 
We. On suppose que la droite Wo n’est pas incluse dans un sous-fibré a connexion 
propre de M, et que l'une ou l’autre des hypothéses suivantes est satisfaite : 


(i) To trl; >nL + c(V) et L >T\; 
(ii) To + rT, > nL +nc(V) et W, est non dégénéré. 


Alors, s = 0. 


Démonstration. Dans le cas contraire, il existerait un sous-fibré M’ vérifiant Ja 
conclusion du théoréme 2. Mais c’est impossible, puisque l’hypothése faite sur Wo 
entraine Wy M Mi, =0 (comme M’ C Ker(s), il aurait d’ailleurs suffi d’imposer 
s(W) # 0), et qu’avec les notations n’ Vr introduites ci-dessus, on peut minorer, 
dans sa conclusion, l’expression dim(Wy/Wo Mb6).To + dim(W, /W1.9 M').T; 
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e dans le cas (i), par 


To +rT,—dim(W,NM)).T, = To+rT—rk(M’).T; > nL+c(V)—rk(M’).L, 


e dans le cas (11), par 
F 1 ir 1 1 r / é 
Ol let hee tig aly | ee ae ey), 
1 n n 


et donc toujours strictement par rk(M/M’)L + c(V). 


3 Les deux premieres étapes, et le lemme d’interpolation 
de M. Laurent 


Nous pouvons maintenant préciser les deux premiéres étapes de la preuve du 
théoréme 1. Nous reprenons les notations M, V,S,€,n = rk(M),n(E€),r(E), kK = 
[K : Q] de l’introduction. Le point 0 est en général une singularité de V, mais par 
hypothése, la section horizontale € y admet un prolongement analytique en un K E- 
vecteur, qui appartient donc a Mo. Nous choisissons pour Wo le K -sous-espace 
vectoriel de dimension 1 que € y engendre. Au point 1 ¢ S, nous notons W, le 
K-sous-espace vectoriel de dimension r := r;(€) de M, engendré par €(1), et nous 
choisissons pour W le sous-espace horizontal de My, correspondant a Wj. 

Deux réductions nous seront utiles. Tout d’abord, on peut sans perte de généralité 
supposer que les composantes €),...,€, de la solution étudiée sont linéairement 
indépendantes sur K(z), c’est-a-dire que n(€) = n ; en effet (cf. [Sh], 4, lemme 2), 
le saturé M du fibré a connexion que € engendre dans M/ n’a pas de singularité au 
point y = 1 ¢ S. Appliqué a ce saturé, dont le rang 7 est égal 4 n(E), la preuve 
aboutit a l’inégalité r)(€) > a, qui est bien la conclusion recherchée. (En d’ autres 
termes, on peut supposer que Wo est non dégénéré.) 

La seconde réduction ne s’avére nécessaire que pour la preuve du §5, ot la 
contrainte numérique L > 7; du §4 n’est pas vérifiée, mais par souci de symétrie, 
nous la ferons dans les deux cas. Sans perte de généralité, on peut également 
supposer que VY; est non dégénéré. En effet, si ce n’est pas le cas, il existe un 
sous-fibré M’ # M _ génériquement stable sous V tel qu’avec les notations de 
(x), a < =, et on peut choisir M’ de telle sorte qu’il réalise le minimum 
de ces expressions. Considérons alors le fibré quotient M = M/M’, _qui est 


muni de la connexion a singularités quotient V, et relativement auquel Wi := 
W/ Win M' est non dégénéré: en effet, ses sous-fibrés sont de la forme ick 
avec M’ Cc - Ker(s), et les quantités n” := rk(M/MY”) = ee ess =n" et 


r= dim, /W, AM") = = dim, /W, M') := =r” vérifient : == =f se 


— n’ 


par minimalité de ce dernier quotient. On vérifie de plus, en coiieatt en une base 
de M*(P;—oc) un systéme primitif d’équations de M’, que V n’a pas de singularité 
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en 1, et que l’image € de € dans M est encore un K E-vecteur en 0. Le théoréme 


i applique aM et E et au sous-espace non dégénéré Wy, fournit la conclusion 
K> a Comme 4 = > = ona bienr > os 

Ces réductions étant Aoquises. soient c le plus petit entier supérieur 4 nc(V), et 
To, T;, L trois entiers tels que 


To +rT; =nL+c ( > >) 


Dans ces conditions, 
(i) la premiére étape de la preuve consiste a choisir: 


* pour I(L) l’espace des sections de M*(L), qui, pour L assez grand, est de 
dimension h°(L) = n(L + 1) — deg(M) ~ nL; 

* pour Ev(7To,7;) Vespace vectoriel Homx (Wo, OpolTol) ® Homx mM, 
0; [7;]), qui est de dimension dim(W). To + dim(W)). T, =To+1rTh; 

* pour ¢ l’application K-linéaire de '(L) dans Ev(To, 7,) somme directe 
sura = 0,1, des composées des applications canoniques T'(L) > M?, 
M*q > Hom(Wy, Og ), OO [71]. Pour s dans I'(L), $(s) représente 
donc la collection des 7 premiers coefficients de Taylor en 0 de s(€) et des 
T, premiers coefficients de Taylor en 1 de s(Z), ot Z parcourt W;. On 
trouvera plus bas une expression matricielle de @: une matrice ® a 7; + rTo 
lignes et h°(L) ~ nL colonnes. 


(ii) la seconde étape consiste a remarquer que puisque 7p) + r7; = nL +c > 
nL + nc(V), le deuxiéme cas de la proposition 1, joint aux hypothéses 
n = n(E) et W non dégénéré auxquelles nous nous sommes ramenés, entraine 
que pour s € I(L), $(s) ne peut s’annuler que si s = 0, autrement dit 
que @ est injective. L'une des coordonnées de A’) dans une base de 
A) Hom(I(L), Ev(T;;To)) (c’est-a-dire l'un des mineurs d’ordre h°(L) 
de ®) est ainsi un élément A de K non nul. Reste 

(iii) a évaluer la hauteur de A, c’est-a-dire ses valeurs absolues pour les différentes 
places v € Vx de K. 


Pour le facteur local en la place oo, cette derniére étape repose sur |’énoncé 
suivant, dont on trouvera diverses variantes dans [La], [S], Lemme 4.4.1, [LP], 
Lemme 7, [H]. 


Théoréme 3 (Lemme d’interpolation de M. Laurent). Soient D un disque de C 
centré a l’origine, yo, y; deux points de D, To, 7, deux ensembles d’entiers > 0 
et majorés respectivement par Ty — 1, T; — 1, h la somme des cardinaux de ‘Jo et de 
Ti, et fi,..., fn des fonctions analytiques sur D. Posons € = Ty + T; —h. Alors, la 


fonction 6(z) = det ( L)2)) eq,1<i<h 


s’annule en 0 avec une multiplicité 
Pe 
219T; =eUo-+ 1). 
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Démonstration. En effectuant des combinaisons linéaires des colonnes, on peut 
supposer que f(z) € z’ =10)4. Lordre de 6 en 0 est alors minoré par ¥=1,..,(i — 


1) - Stent — Sent = ER — BEY — AGED = TT, — (To + 7) + SY. 


Appliqué aux points yo = 0 et y; = 1 (dont on notera qu’ils interviennent 
ici sur un pied d’égalité), cet énoncé fournit, tant au §4 qu’au §5, une majoration 
trés précise des valeurs absolues en co de mineurs d’ordre ~ Jp + T, de A. 
Avant de débuter cette estimation et celles des |A|,,v € Vx, je signale pour la 
commodité du lecteur que les paramétres 7p, 7; seront en fin de preuve choisis de 
facon proportionnelle a L, et que les termes dominants des majorations sont de 
la forme e?7/"Lesé, C’est y qu’il convient de contréler précisément. D’ autre part, 
je désignerai par co,c1,... des entiers > 2 qui ne dépendent que de M,V,€, 
et que l’on pourrait d’ailleurs majorer de fagon effective en termes de la matrice 
A(z) et de €. 


4 Conclusion de la premiére preuve (cas général) 


Matriciellement, ¢ s’exprime comme suit: fixons une base (81,... , 5,0(z)) de P(L), 
et considérons la base de l’ espace but de ¢ construite a partir d’une base Z),..., Z; 
de Wi, de la base € de Wo, et de la base de I’ espace Oo, [Tx], @ = 0, 1 formée des 
mondmes {4 (z—a)' ;t =0,..., 7, — 1}. Alors, ¢ est représentée par la matrice a 
Ty + rT, lignes et h°(L) colonnes & coefficients dans K: 


Do Py = (0'(5;-E)(0)) o<t<t)—1:1<i<h(L) 
= 


’ 


@, = (0'(s;.2,)0)) rer, —iaerenneny 
®, tae (p=1.....7) 


ou 0 désigne le champ de vecteurs d/dz. 

Du cété de l’espace source, identifions le fibré M 4 O(a) ®---. ® O(an), 
choisissons un repére adapté (e),...,@,) de sections de M au-dessus de P, \ oo, 
dott l’écriture EE = L_E,e, = "(E\,...,En), Zp = Li) Zp.e, pour p = 1,...,7, 
et une décomposition [(L) = @,=1.,...n [(O(L —a;))e;*. Pour tout L’, choisissons 
pour base de I’(O(L’)) les monémes {rz ;€=0,...,L’} autre base fera 
Vobjet du §5). Dans la base {acer =1,...,n,€=0,. —a,}de I(L) = 
H°(M ® O(L)) indexant ses h°(L) = n(L + 1)— Da, a notre matrice 
se réécrit: 


Do =('(4 ge, )(0)) 0<t<To—13;1<i<n,0<l<L—a, 


PD, = (0' (42 Zp.)(1)) 0<t<T\—13;1<i<n,0<l<L—a, 
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Quitte a remplacer L par L + co dans les estimations qui suivent, on peut, comme 
je vais le faire, supposer les a, tous nuls (fibré trivial). Soit alors A l’un des 
déterminants mineurs d’ ordre maximal Mo(L) = n(L + 1) de @ non nul. 

(A) Majoration de |A|, (v € Vx quelconque) 

Elle repose sur une majoration terme a terme des valeurs absolues v-adiques des 
coefficients de A. 

Al. Coefficients des lignes de A situées dans ®o: les prolongements analytiques 
a lVorigine €,(z) = Deroain sont par hypothése des K E-séries. Comme le 
coefficient 


t 


a'(G£)0) = ( ‘) aie = Sit >, 
et s’annule si ¢ < £, sa valeur absolue v-adique est majorée par |d7,-;|7! si v est 
finie, par ee si v est archimédienne. Ici, d7,-; désigne un dénominateur commun 
des a,x, k = 0,...., 7 — 1, qu’on majore suivant la définition de Lang par oP 
(Le produit c;c est relié a la taille des transformées de Laplace des séries E,(z).) 
A2. Coefficients des lignes de A situées dans les ®,,0 = 1,...,r: pour tout 
s € I(L) et tout Z horizontal, 0‘(s.Z) = ((V#)'s).Z. Comme | est un point 
ordinaire, on en déduit que si v est une place finie, le coefficient 0° ne Zou) a 
une valeur absolue v-adique < |L!|>!|c3|7"|cs|, ; si v est archimédienne, elle est 
< iT, tc]! co. 2(Le produit c3c4 est relié a la taille du “transformé de Fourier” de V ; 
le produit c5c¢ a la hauteur de W,.) 

A est somme de /o(L)! monémes ot apparaissent les coefficients du premier 


type au plus To fois, du deuxiéme type au plus r7, fois. Ainsi 


T] lAly S$ C@@ + 1)! (ere2)™ (eseay'™ (ese6)"™ (LITT 


veEVK ,v#oo 


Le terme prépondérant de cette majoration est donc (L!7;!)""*, dont le logarithme 
croit comme «(L + 7;)T\LogL. 

(B) Majoration de |A| go (méthode de M. Laurent) 

Comme on l’a dit, elle repose sur le théoreme 3. Quelques préparatifs avant de 
V appliquer a certains mineurs (d’ordre h ~ Ty + T;) de A. Tout d’abord, € € M, ® 
C étant une combinaison linéaire a coefficients complexes A, Z) +---+A,Z,, avec, 
disons, A; 4 0, on ne change guére A quand on remplace dans @ le bloc , par la 
combinaison linéaire correspondante ®; = A;®, + --- + A,@,. Plus précisément, 
A se déduit de ® en rayant c’ := Ty + rT; — h°(L) = c —n lignes (cf. (* * *)). 
Par conséquent, les indices t d’au moins 7; — c’ lignes de ®, intervenant dans A 


?Tl est possible que le théoréme de pureté d’ André permette de se débarrasser des termes en 7)! 
dans ce calcul archimédien. Je reviens sur ce point au §5. Ici, le gain serait faible puisqu’on prendra 
L>T,. 
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sont également indices de lignes de @o, de @3,...et de &, intervenant dans 
A. Notons 7; l'ensemble, de cardinal t} > 7, — c’, de ces indices communs, et 
To l'ensemble, de cardinal t) > Tp —c’, des indices de ligne de ®p intervenant dans 
A. A division par +A}! prés, A est donc égal au déterminant A de la matrice carrée 
d’ordre h°(L) = n(L +1): 


2 Pp C Po Py = (0'(51-E)(0)) reteicicnL) 
P=(P CH]. | O = (0(-E)()) reqicna) 
Uc®@ Ww 


ot les lignes de W sont, pour au plus To — To d’entre elles, des lignes de ®o, et pour 
au plus rT; — tT d’entre elles, des lignes de ©), 2, ...., ou ®,. 

Suivant [S], nous majorons maintenant la valeur absolue du nombre complexe 
A en calculant son développement de Laplace suivant les mineurs d’ ordre maximal 
deW: 
B1. Valeur absolue en co des mineurs d’ordre maximal de W : la considération de 
leur format et les calculs précédents sur |A|,, avec v = oo, montrent que chacun 
d’eux est < (n(L+1)—1t-—T7)! ge (cee! T!)""-4 . Comme To — to < c’, 
ils sont donc majorés par (n(Z + 1) — (t + 1))!es (coc, ny 
B2. Valeur absolue en oo des mineurs d’ ordre maximal h := t) + t de la matrice 


(3) : ils sont tous de la forme 6(1), pour des fonctions 6(z) du type étudié au 
jeune d’interpolation : prendre yp = 0,y; = 1, ff = 5;.€, oli parcourt une 
partie 4 h éléments de l’ensemble [1,°(L)], et reprendre les notations ci-dessus 
pour 7o, 7 ; en particulier, h = Ty + T; — C, avec ¢ < 2c. En vertu du lemme de 
Schwarz, appliqué sur des disques de rayon | et R > 1, chacun d’eux admet une 
majoration de la forme 


[8] < ROPNP-M*MVALY (sup, cr, 19 iO)” (Sup, er, i 1 Fila)": 


La premiére parenthése se majore comme on |’a fait des coefficients de |A], en 
A1. Pour la seconde, noter que les E-fonctions €, sont des fonctions entiéres de 
type exponentiel. Si c7 > 1 désigne un majorant de leurs types, on déduit des 
inégalités de Cauchy |0’ filr < LI flor que pour fj = AE, etR = Th, 
é T2 4s 
ce dernier terme est majoré par (supp, PF-e771 7 < e*77, Ainsi |6(1)| < 
= Te 2 Sipe saa 

T, WP pag ene Ty 8 e*77 En définitive, la formule de Laplace donne, avec 
R= Ti : 


rT? 27? 
1 0 
Cy 


Ale 27," <A ie rr ea, 


Le terme prépondérant de cette majoration est donc pe (7,!)'", dont le loga- 
rithme croit comme (—7p + r7\)T, LogT}. 
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(C) Formule du produit et choix des paramétres 


Nous faisons maintenant croitre L indéfiniment, et supposons que To et 7), qui 
étaient jusqu’a présent soumis 4 la seule contrainte Ty + r7, = nL +c de (* * *), 
croissent linéairement avec L. Comme A # 0, la formule du produit entraine : 


ToT, < rkLT, + r(k + 1)T? + O(L?/LogL), 


donc a <rke+r(k + 14 + O( rear). Choisissant, a l’instar de Siegel et 
Shidlovsky, To = (n —€)L,T; = (€L + c)/r avec € petit, on obtient: n —e < 


rck(1 + O(e)), et finalement la conclusion n < rx souhaitée. 


5 Conclusion de la seconde preuve (cas de 
Lindemann—Weierstrass) 


Dans la preuve précédente, l’hypothése que € est un vecteur de E-fonctions est 
intervenue a deux reprises : au point Al pour majorer la hauteur des coefficients 
des séries entiéres s;.€ € K|[[z]] ; au point B2 pour majorer la croissance 4 |’infini 
des fonctions entiéres s;.€. Nous n’avons utilisé des sections horizontales Z, € W 
de V que la propriété de convergence au voisinage du point 1. Or le théoréme de 
pureté d’Y. André sur les séries Gevrey de type négatif ([A2.I], Théorémes 4.2 et 
4.3.iii) affirme que dés que n(€) = n, toutes les solutions de V relévent de la 
théorie des E-fonctions. Plus précisément (voir [A2.I], corollaire 4.4), V admet 
alors un systéme fondamental de solutions en 0 de la forme U(z)z©, ot e7/7© € 
GL,,(C) représente la monodromie de V au point 0, et U(z) est une matrice dont 
tous les coefficients sont des F-fonctions. Nous allons maintenant faire usage de 
cette propriété, en supposant, pour éviter la présence au voisinage de | de produits 
de E-fonctions par des G-fonctions, que la monodromie de V en 0 est triviale. 

Jointe 4 l’existence d’un E-vecteur solution de V, cette hypothése limite 
en pratique les applications aux cas couverts par le théoréme de Lindemann— 
Weiesrtrass. Elle permet en tous cas, par torsion par une puissance entiére de z, 
de supposer que V admet une base de solutions constituée de E-vecteurs, et qu’en 
particulier, toutes ses sections horizontales Z, € W admettent des prolongements 
analytiques en des fonctions entiéres de type exponentiel. 

Mais pour exploiter cette idée, encore faut-il contréler les hauteurs des coeffi- 
cients des séries entiéres s;.Z, € K[[z— 1]]. Ceci conduit (voir A’2 ci-dessous) a 
remplacer les mondmes z de la base de T'(O(L’)) = Sym“ (T(O(1)) choisie au 
début du §4 par {az —1)';£ =0,..., L’}, d’ou une nouvelle matrice 


Do Do = (8 (A(z — Y)*ENO)) o<rem-t;1<1<n0<t<L—a, 
D, 


Ge saebs 
®, = (a (4 6S 1)°Z,,)()) 0<t<T)—1;1<i<n,0<l<L—a, 


®,. ane (p=1.....r) 


représentant ¢. 
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Cette nouvelle matrice se déduit de celle du §4 par multiplication a droite 
par un élément de SLjo(,). En rayant les Ty + rT, — h°(L) mémes lignes que 
précédemment, on obtient donc le méme déterminant mineur A € K*. Mais de 
nouvelles estimations apparaissent pour | A|,: 

(A’) Majoration de |A\,, v € Vx quelconque 

A’1. Coefficients des lignes de A situées dans ®o: si v est une place finie, les 
factorielles ne se simplifient plus, et on doit multiplier le majorant trouvé en A1, 
par |L!|>'; celui des places archimédiennes devient Pe.” 

A’2. Coefficients des lignes de A situées dans les ®,,p = 1,...,r : on remarque 
t 
i 
ordinaire de V. On peut donc supprimer le facteur | L!|>! de la majoration de A2 aux 
places finies. De plus, on peut, aux places v infinies, en supprimer le facteur 7! : en 
effet, les fonctions induites par Z,, sur K, C C sont entiéres de type exponentiel; 
Si cg => 1 désigne un majorant de leurs types, les inégalités de Cauchy, appliquées a 
un disque de rayon 7;, entrainent que |8'((4(z — 1) Zp.) (|v < eM Tes", 

Le terme prépondérant du produit JT,2.9|A|, devient done (L Pee meh tolaee 
(B') Majoration de | A|oo (méthode de M. Laurent) 

On reproduit la démonstration du §4.B jusqu’a l’étude du développement de 
Laplace, qu’on modifie comme suit. 


maintenant que 0((7,(z — 1)'Z,,)(1) = ( ) a'~"Z, (1), et que 1 est un point 


B’1. Valeur absolue en oo des mineurs d’ordre maximal de W: pour les mémes 
raisons qu’en A’2, on peut supprimer le facteur (T;!)’" de la majoration de B1; 


B’2. Valeur absolue en oo des mineurs d’ordre maximal h := t + 1% de la 
. D : : ‘ : 
matrice (3) pas de changement par rapport 4 B2, si ce n’est que les fonctions 

1 


fi s’écrivent maintenant f; = az — 1)*€,. On peut alors majorer la premiére 
parenthése comme dans A’1, et la seconde en appliquant l’inégalité de Cauchy sur 
un disque de rayon R = T). 

Le terme prépondérant du majorant de |A|go devient donc ie a 
(C’) Formule du produit et choix des parametres 


La formule du produit donne cette fois 
ToT, < KTypL + O(L?/LogL), 


done 2 < k + O(qsar): Choisissant, 4 V’instar d’André [A211], 7; = 
C — €)L,T) = erL +c, on aboutit a la conclusion : < x souhaitée. 

Remarque 1. On voit en combinant les deux approches avant de fixer les 
paramétres que TTT < <K«(1 + o(1)). C’est bien en prenant r7\ 
(comme au §4) ou 7p (comme ic1) aussi petit que possible qu’on obtient le meilleur 
résultat. 

Remarque 2. Le choix de paramétres rT; ~ nL fait dans ce §5 évoque la méthode 
de Gel’ fond—Débes utilisée dans [A1] et [A2.II]. Yves André me signale d’ ailleurs 
que la démarche présentée ici permet de retrouver ses résultats d’indépendance 
linéaire [A1] sur les valeurs de G-fonctions. 
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6 L’approche duale 


Restreinte au cas de Lindemann—Weierstrass €,(z) = e“*,t = 1,...,n,lapreuve du 
§4 est “duale” de celle de [S], qui l’a fortement inspirée*. Plus précisément, A. Sert 
étudie la fonction e* aux points @),..., @,, et considére une matrice @ an (L +1) 
lignes et Ty + rT, colonnes, construite a partir d’une base {b,,...,b,} de W; dans 
M,, et des matrices 


A 1 
dy = (Galea) | 
£! 1<i<n,0<<L:0<t<T—1 


: _ 1 : 

b, = (« “ aleNa)) 
£! 1<i<n,0<l<L;0<t<T,-1 

(Pour passer de mes notations a celles de [S], utiliser le dictionnnaire: @ > 

M,L—> J,T) > S,T; ~ T,n ~ m,r > n.) Comme 


1 


1 1 1 
Goa) =a (F2e""] (0), e* a! eeY(a) = a ie) (), 


f! t! 

® est essentiellement la transposée de la matrice ® du §4. Mais malgré sa 
ressemblance avec celui du §2, le “lemme de zéros” utilisé par Sert ([S], Lemme 4.2) 
n’est pas de méme nature : il énonce I’existence d’une constante ¢ = n*/4 telle que 
(pour L > 7}), & est de rang maximal 7) + rT, dés quenL > Ty + rT; +¢. C’est 
un lemme d’interpolation au sens de D. Masser : une condition suffisante pour que 
le morphisme d’ évaluation ¢@ lui-méme (et non son transposé) soit surjectif. 

Comme me I’ ont fait remarquer M. Laurent et S. Fischler, les formules supra (du- 
alité de Fourier—Borel) montrent qu’un lemme d’ interpolation s’interpréte comme 
un lemme de zéros lorsqu’on se restreint a l’ensemble des exponentielles-polynémes, 
c’est-a-dire a des systémes différentiels 4 coefficients constants. Je me propose 
maintenant d’étendre I’ énoncé de surjectivité de [S] au cas d’un systéme différentiel 
général. Il est possible que la dualité que fournit la transformation de Fourier (voir 
[A2.I], $6) y conduise, mais j’établirai cette extension par un argument élémentaire, 
proche de ceux de Masser et de [F]. Pour des raisons expliquées plus bas, je préfére 
d’ailleurs appeler “lemme d’annulation” cette généralisation. 

Pour alléger, je ne traiterai que de la situation rencontrée a la Proposition | du §2, 
et en supposant le fibré M trivial. On reprend les notations n=rk(M), V,R, We 
(a € R),c(V) du §2, avec donc R = {0, 1} et dim(Wo) = 1. On pose 
r = dim()), et on note € (resp.{Z),..., Z,}) une base Wo (resp. W,) sur K. 
Outre les hypothéses de non-dégénérescence faites 4 la Proposition 1, on devra 


3C’est d’ailleurs une mesure d’indépendance algébrique entiérement explicite que [S] obtient dans 
ce cas. Dans cet ordre d’idées, je ne sais si la méthode du §4 permettra d’établir, 4 l’instar de [L1], 
une version quantitative du théoréme 1. 
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ici supposer que €(0) 4 0, et que 1 est un point ordinaire de V, de sorte que 
{Z\(1),..., Z,(1)} forme une base de W,. 


Proposition 2 (Lemme d’annulation). I] existe une constante ¢(V) effectivement 
calculable en fonction de M et de V et vérifiant la propriété suivante. Soient 
To, T,, L un triplet d’entiers > 0 et {a9,,0 < t < To-—l,ajy,1 <i < 7,0 < 
t < T; — 1} un (To + rT))-uple d’éléments de K. Supposons que la droite Wo ne 
soit pas incluse dans un sous-fibré a connexion propre de M, et que E(0) # 0. 
Supposons par ailleurs que | soit un point ordinaire de V, et que l’une ou l'autre 
des hypothéses suivantes soit satisfaite: 


(i) L}=T; 
(ii) W, est non dégénéré. 


Supposons enfin quenL > Tyo + rT, + €(V). Alors, il existe une section s de 
M*(L) telle que 0'(s.€)(0) = ao, pour tout t < Ty — 1 et 0(s.Z;)(1) = aj pour 
touti = 1,...,7,¢<7T,—-1. 

Démonstration. Il s’agit de montrer que sous ces hypothéses, les Ty + rT, formes 
linéaires sur '(L) := H°(M*(L)) définies par 


evor(s) = 0'(s.E)(0)(t = 0,...,T) — 1); 
eviz(s) = 0'(8.Z;)A)G = 1,...,r3t =0,...,7,—1) 


sont linéairement indépendantes sur K (c’est-a-dire que les Ty + rT) lignes de la 
matrice ® du §4 le sont; le lemme de zéros du §2 exprimait, pour Ty + rT; > 
nL +c(V), Pindépendance linéaire de ses n(L + 1) colonnes). Fixons une matrice 
représentative A(z) de la connection V, dont on suppose que le dénominateur Q(z) 
ne s’annule pas en 1, et est d’ordre 6 minimal en 0, et notons deg(V) le maximum 
des degrés de Q et des coefficients de QA. 

Supposons tout d’abord que 7; = 0, et que 0 n’est pas une singularité de 
V (autrement dit, que Q ne s’annule pas en 0; la condition evo9 4 0 est alors 
automatique), et construisons un nombre réel cj(V) tel que pour tout L et tout 
To < nb — co(V), les éléments evo, := ev,;0 < t < To — 1 de I'(L)* sont 
linéairement indépendants sur K. 

Soit t) > 1 le plus grand entier tel que les ev,;0 < t < t)—1 soient linéairement 
indépendants. Si un élément s de /*(L) s’annule a l’ordre to (sous entendu : le long 
de Wo), il s’annulera alors al’ ordre t) + 1. Soit L’ tel que t]) +n > n(L’ +1) => t. 
Par algébre linéaire, il existe o € [(L’) non nul annulant les ev,,t < t) — 1. Pour 
tout k tel que L’ + kdeg(V) < L, (Voay" (o) est dans '(L). Notons k, la partie 
entiére de (L — L’)/ deg(V). 

Comme Q(0) 4 0, on déduit de la relation Vs,(V6,5).€ = Qd(s.€) et de 
Vhypothése sur ev,, que o s’annule a l’ordre t) + k;. Le lemme de zéros donne 
to +k, <nL'+c(V). Ainsi,k, < c(V), et |) > nL’ > n(L—(c(V) + 1)deg(V)). 
Pour To < nL — c4(V) avec c)(V) = n(c(V) + 1) deg(V), on peut done affirmer 
que les ev;;0 < t < Ty — 1 sont linéairement indépendants. 
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Si 0 est une singularité de V, on modifie l’argument de la fagon suivante, qui 
revient A tordre le fibré M par O(—cé). Soit OQ = z°Qy, avec O,(0) F 0, et 
soit c un entier > 2c(V). Pour t — cO ~ nL’, il existe o € I(L’) s’annulant 
a l’ordre t) — cO le long de Wp. Alors, 6 := 28a € I(L’ + c@) s’annule a 
Vordre Tt. Pour tout k < c tel que L’ + cO + kdeg(V) < L, (V5,a)* @) est dans 
I(L). Il en découle comme supra que 6 s’annule a l’ordre t) + k, o a l’ordre 
t —cO+k, donc |] —cO +k <nL' + c(V). On en déduit en notant k, la partie 
entiére de (L — L’ — c6)/deg(V) que inf(c,k,) < c(V), donc k; < c(V). Ainsi, 
tT >nL' +c >nL —(n—1)c6 — ch (V), et cf (V) = 2ncg(V)O convient. 

Montrons maintenant que si rT; < nL — c}\(V) avec c}(V) = 3nc(V)deg(V), 
les evj,;,i = 1,...,7,0 < t < T; —1 sont linéairement indépendants. Soit t; > 1 
le plus grand entier tel que tous les ev;,;,i = 1,...,7,f < tT, sont indépendants, 
tandis que (disons) ev;,,, en dépende. Soit encore c > 2c(V) et soit L’ tel que 
rt +(r— 1c ~ nL’. Par algébre linéaire, il existe o € (L’) non nul s’annulant 
a lordre t; + c le long des Z;,i = 2,...,r, et 4 ’ordre tT, le long de Z. Par 
l’argument supra, o s’annule a l’ordre t, + k le long de tous les Z; tant que k < c 
et que L'+kdeg(V) < L, et celaimpose sous ces contraintes que r(t; +k) <nL/+ 
c(V) <rmt(r—1)c+c(V), V@otk < (l1-+)c <c. Ainsi, [As] < (2—+)c(V) 
etrt) >nL—c{(V). 

Le cas général enfin: on va vérifier que la constante ¢(V) := sup(c} (V), co(V)) 

répond a la question. Si Ty +rT, < nL—€(V), onaen particulier rT; < nL—c}(V), 
donc les rT; évaluations de I"(L) le long de VW sont linéairement indépendantes. 
Soit To le plus grand entier tel que les evo, t < tT) — 1 les complétent en un systéme 
libre. Supposons pour simplifier que 0 n’est pas une singularité (suivre le 2e pas pour 
le cas général). Pour c = 2c(V) et L’ tel que t) + r(7, +c) ~ nL’, on construit 
o € I(L’) s’annulant a l’ordre T; + c le long de Wi, a l’ordre to le long de Wo. 
D’ou pour toutk <c tel que L’+k deg(V) < L,m+k+r(T, +c) <nL’+c(V), 
imposant k < c(V), ett + rT; > nL — cj (V). 
Remarque 3. Je préfére donner au lemme la dénomination “‘annulation” (vanishing 
lemma) plutét qu’ “interpolation”, d’abord pour éviter une confusion avec le lemme 
d’interpolation du §3, mais surtout parce qu’il exprime l’annulation d’un H!. Plus 
précisément, soit 77,7, le sous-faisceau de M* des germes de sections s’annulant 4 
un ordre > Tp le long de Wo, aun ordre > 7} le long de Wi. Supposons M trivial, 
de sorte que H'(M*(L)) = 0 dés que L > 0. La suite exacte0 > Jnr, > 
M* + M*/I1.7, — 0 fournit aprés tensorisation par O(L) une suite exacte 


H°(Fny.7,(L)) > HO(M*(L)) > H°(M*/In.7,)(L))> A (In,7 (L))9, 


ou l’on reconnait en deuxiéme terme J"(L), en troisigme l’espace Ev(To, 71) ~ 
C’ @C™ (ce toujours sous l’hypothése €(0) # 0, qui signifie que Mo/Oo @Wo 
est sans torsion — condition automatiquement satisfaite au point ordinaire 1), et en 
deuxieme fléche notre application ¢. 

Un fibré de degré négatif ne peut étre engendré génériquement par ses sections 
globales. En itérant l’action de la connexion sur les sections de Trp (Lb! ), on en 
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déduit des preuves plus naturelles du lemme de zéros et du lemme d’annulation. 
Par exemple, dans le cas élémentaire ot V est irréductible et avec c(V) = 
(2deg(V) + 1)n? = é(V): 


— le fibré F~~-on,7,-n (L +ndeg(V)) est de degré < nL —Ty—rT, +c(V); donc si 
cette expression est négative, H°(77,,7, (L)) doit s’annuler: de facon équivalente, 
¢ est injective; 

— dés quenL —T)—rT, > €(V), le fibré Fn, 4.6.7, 4n(L —ndeg(V)) a une section 
non nulle, donc J7,7,(L) est engendré génériquement par son H°, et son H! 
s’annule: de fagon équivalente, ¢ est surjective. 


Méme dans le cas général, la situation est plus simple que pour les groupes 
algébriques [F]: si nos fibres sont de dimension n plutot que 1, notre base, de 
dimension 1, ne nécessite pas de recourir a la théorie de Il’intersection; elle devrait 
de plus permettre de passer par dualité de Serre de l’un a l’autre des énoncés. 


Remarque 4 (en guise de conclusion). Le fait que le théoréme de Siegel—Shidlovsky 
porte sur des connexions a singularités irréguliéres a limité son impact en géomeétrie 
algébrique. Etait-ce dans cette direction qu’allait la suggestion proposée par P. 
Cartier 4 S. Lang, et dont personne ne se souvient (voir [W], §1) ? On peut en tous 
cas espérer que I’ ébauche esquissée par Deligne d’une théorie de Hodge irréguliére 
lui fournira de nouveaux champs d’ applications. Ainsi (voir [D], I.§4 ou II.§8), les 


valeurs Jo(A) = Reso(e't 7 dt) de la fonction de Bessel (resp. de sa dérivée J) (A)) 
s’interprétent dans cette théorie comme des quotients par 2i 2 de périodes de formes 
de premiére (resp. deuxiéme) espéce généralisées le long d’un cycle invariant sous 
la monodromie, et le théoréme de Siegel montre que—comme pour une courbe 
elliptique définie sur Q(A)—ces périodes sont linéairement indépendantes sur Q 
quand X est un nombre algébrique. Leur transcendance, et celle de leurs autres 
périodes, ne sont pas encore connues. 
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Abstract Let G = SO3(C), = SO3(Z[i]), K = SO(3), and let X be the locally 
symmetric space '\G/K. In this paper, we present a relationship between the 
heat kernel on SL3(C) and SO3(C). We write down explicit equations defining a 
fundamental domain for the action of ” on G/K. The fundamental domain is well 
adapted for studying the theory of /’-invariant functions on G/K. We write down 
equations defining a fundamental domain for the subgroup Iz = SO(2, 1)z of I" 
acting on the symmetric space Gr/ Kp, where Gp is the split real form SO(2, 1) 
of G and Kg is its maximal compact subgroup SO(2). We formulate a simple 
geometric relation between the fundamental domains of J” and Iz so described. 
Both the formula for the heat kernel and the fundamental domains are designed 
to aid in a detailed study of the spectral theory of X and the embedded subspace 
Xp = 'z\Gpr/Kr. 
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General introduction 


While Serge Lang and Jay Jorgenson were working towards a derivation of a 
theta relation and the Selberg zeta function associated to SL, (Z[i])\SL,(C)/SU(”) 
from the Selberg trace formula applied to the heat kernel (see the books [20], 
[18], [17], and [19]) they envisioned a similar “ladder” of trace formulas and zeta 
functions associated to the groups SO, (Z[i])\SO,,(C)/SO, (R) (see the introduction 
to [19]). They suggested to us that we attempt to complete that project, starting with 
the base case of SO3(Z[i])\SO3(C)/SO3(R). This paper consists of two distinct 
but interrelated parts of this project that have been completed to date. Section 
1 constructs the heat gaussian (from which the heat kernel may be derived in 
an entirely standard way, as on p. 48 of [19]) on SO3(C). In analogy with the 
I'\SL2(C)/SU(2) project, this section has the same purpose (though it follows a 
different method) as §2.5 of [19]. Section 2 constructs a fundamental domain for 
the action of the discrete group. In particular, it investigates the group structure 
and a fundamental domain for SO3(Z|i]) acting on the symmetric space. Also, 
considered in this section is the “real form” SO(2, 1)z\SO(2, 1)/SO3(R) (shown 
to be a hyperbolic surface with one cusp), which can be considered the base case 
of another ladder SO(p, g)z\SO(p, g)/SO(p + q) of locally symmetric spaces. 
This section corresponds in function to Chapter VI and parts of Chapter IX of [19], 
although again the method is different, relying directly on an explicit isomorphism 
of 7” = SO(3, Z[i]) with a particular Kleinian group (described in §2.1) rather than a 
direct construction. For much more extensive comments concerning the motivation, 
context, methods, and possible continuation of the work towards the theta relation, 
see the introductions preceding each of the two parts. 


Acknowledgements We would like to thank the editors for providing this opportunity to honor 
Serge’s memory and legacy, and to thank Jay Jorgenson in particular with whom we spent much 
time thinking about the ideas related to this paper. The first author gratefully acknowledges the 
financial support of the Center for Advanced Studies in Mathematics at Ben-Gurion University, 
which made this collaboration possible. 


1 Obtaining the heat kernel by integration over 
a normal bundle 


We take as our starting point knowledge of the heat gaussian via explicit formulas, 
on SL,(C)/SU(”), for which see [20] in general and [19], §2.5, for the special 
case of n = 2. Our viewpoint will be that one can obtain the heat kernel of the 
noncompact symmetric space SO, (C)/SO(n) by embedding SO,,(C) in the larger 
group SL,,(C) and integrating the heat gaussian on SL, (C)/SU(”) along certain 
normal fibers. 

Consider the following general setup. Let G; C G be noncompact real reductive 
Lie groups. It is possible to choose a maximal compact subgroup K of G so that 
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K, := G, 1 K is a maximal compact subgroup of G. In our setup we also assume 
there exists another, not necessarily reductive, group H C G such that G has a 
smooth decomposition G = HG,K. This then yields a product decomposition on 
the level of symmetric spaces G/K ~ Y x G,/K,, where Y is some quotient of 
H. Projecting onto the second component, one obtains a projection z : G/K —> 
G,/K. Viewing the heat kernel as a probability measure jz, on G/ K, the question is 
whether the push-forward of jz; under zr is equal to the heat kernel on G; / Ky. If one 
prefers to think of the heat kernel (or more precisely, heat gaussian) as a family of 
functions h;, then the push-forward becomes “integrating over the normals.” More 
precisely, define the function h? on G,/K, by 


n!(g1) = [ h,(hgi)dh. (1) 


The question is under what conditions on G, G, ish” the heat gaussian on G}. 

There are two main classes of examples where one does in fact obtain the heat 
kernel on G; in this fashion. The specific example SO3(C) C SL3(C) in this paper 
does not belong to either class and seems to be a low-dimensional coincidence. See 
comments at the end of the paper for further discussion. 

The simplest class is that in which G; is a Levi component of a parabolic 
subgroup of G and H is the nilradical of the parabolic. The push-forward is realized 
as an integral over the orbits of the nilpotent group H that are perpendicular to 
G,/K. Note that the case of (2n — 1)-dimensional hyperbolic space sitting inside 
2n-dimensional hyperbolic space has a similar, though distinct, setup. 

The second main class is what Flensted-Jensen worked on, where G; is a real 
form of G and H = K the complexification of a maximal compact subgroup of 
G,. Here the push-forward becomes integrals over spaces traced by the H -orbits of 
points in G,/K,. Except for the H-orbit through eK, which is isomorphic to the 
symmetric space H/K,, these H-orbits are not composed of geodesics. They are 
rather built of paths that are perpendicular to G,/K, and that maintain a constant 
distance from symmetric space which is the H-orbit through the identity. In this 
case there is a subtlety as to how the above integral describes the heat Gaussian: the 
value of the heat Gaussian at gj g{ € G,/ Kj is actually determined by the H -orbit 
through the coset determined by g}. 

The case of SO3(C) C SL3(C) examined here is structurally very similar to 
Flensted-Jensen’s case. The decomposition he uses, G = HG ,K, depends only on 
G, being the fixed-point set of an involution of G, and not necessarily a real form. 
This naturally led to the hope of expanding Flensted-Jensen’s results to encompass 
any G; C G which is the fixed point set of an involution. Unfortunately, it does 
not seem to work in general, with the present case SO3(C) C SL3(C) being a low- 
dimensional exception. It is noteworthy that in this case, it is the value of the heat 
Gaussian at (gig})* € G,/K;, that is determined by the H-orbit through the coset 
determined by g. For further discussion of this phenomenon, see comments at the 
end of the paper. 
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Finally, we mention that in both of the above cases, there is also a functorial 
action in the other direction: the pull-back under z takes eigenfunctions of the 
invariant differential operators on G,/K;, to eigenfunctions on G/K. In the case 
that G; comes from a parabolic subgroup this corresponds to parabolic induction 
of representations. When G;, is a real form of G there seems to be some kind of 
base-change principle at work. It is not yet clear whether the new example here has 
some similar interpretation, perhaps being related to the symmetric square lifting. 

A closed-form formula for the heat kernel on SO3(C) can already be deduced 
from the formulas of Gangolli for the heat kernel on a complex Lie group: cf. [12]. 
The point of this paper is to show how a relationship between the heat kernel on 
classical groups may arise naturally through the push-forward of the heat kernel 
under projection along orbits of the linear group and to place this fact in the context 
of similar phenomena for different projections. Though Serge’s original hope of 
obtaining all heat kernels of symmetric spaces as the “homomorphic image” of the 
heat kernel of SL,,(C) seems to be too much to ask for, it would be nice to have a 
more fundamental understanding of precisely what specific conditions are needed 
on a double coset decomposition G = HG, K to ensure that the resulting projection 
does map the heat kernel of G to the heat kernel of Gy. 


1.1 Setup and notation 


Let G = SL3(C) and let 6(¢g) = (g*)~! define the standard Cartan involution. 
Then K = G® = SU(3) is a maximal compact subgroup of G. We let A and N 
be the subgroups of positive real diagonal matrices and upper triangular unipotent 
matrices respectively. As is well known, G has the Iwasawa decomposition G = 
NAK, which is unique, and the Cartan decomposition G = K AK, in which the A 
component is unique up to permutation of its entries. 

Let J be the symmetric matrix given by antidiag(1,—1,1). Then J defines a 
conjugation of G by g° = JgJ~!, where bar denotes complex conjugation of each 
entry of g. Note that J = J~!. We then obtain a third involution, t, by composing 
o and 6: g° = g?? = J'g !J. The group fixed by t, G; = G*, is thena J 
conjugate of the standard embedding of SO3(C). The advantage of working with 
this conjugate is that its Iwasawa decomposition is compatible with the standard 
Iwasawa coordinates of G. On the Lie algebra level we have t(X) = —J' XJ and 
g, is the set of all matrices of the form 


ab 0 
dO b 
0 d—a 


G, has Iwasawa and Cartan decompositions just like G, where Nj, A;, and K, are 
obtained by intersecting N, A, and K with G,. 
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Let H = G*°, which is isomorphic to SL3(R). There is a generalized Cartan 
decomposition G = HA,K in which the A; component is unique up to the action 
of W,, the Weyl group of SO3(C). This result is due to Flensted-Jensen [9], though 
the main content can already be found in Mostow [22]. Later, we will use this 
decomposition to compare the Casimir operators of G and G;. First we need to 
decompose g and g; under the bracket action of ay. 

It is immediate that a; acts on ny, the Lie algebra of Nj, by scalar multiplication. 
If Y; € a, is the matrix diag(t,0,—1), then for any element Z € nj, [Y,Z] = 
a\(Y)Z = tZ. Furthermore, the corresponding eigenspace is of real dimension 2. 

Decomposing all of n, the Lie algebra of N, under a; is only slightly more 
complicated. The eigencharacter a; corresponds to an eigenspace of real dimension 
4, and the eigencharacter ~. = 2a, has eigenspace of real dimension 2. Observe 
that of the a; eigenspaces, two are fixed by tT, and two have eigenvalue —1 under Tt. 
For @2, both eigenspaces have eigenvalue —1 under tT. 


1.2 Integral formulas 


For the main result, we will need to fix a normalization of measures and obtain 
an integral formula for the generalized Cartan decomposition above. We fix the 
invariant measure to be | on all nondiscrete compact groups and spaces. Let B (resp. 
B,) be a positive multiple of the Killing form of g (resp. gi). Then Blg, = c’ Bi. 
The form B (resp. B,) is positive definite on a (resp. a,) and we use the exponential 
map to push forward the Euclidean measure on a (resp. a,) to a Haar measure da 
on A (resp. da; on A,). We now define the Haar measure dg on G such that 


[ teds= fof freaky s@ak daa 


for any continuous f with compact support. Here, J(a) is the standard Jacobian 
of the Cartan decomposition G = KAK, cf. [15] for example. We normalize the 
measure on G; in the same way and note that in this case, the Jacobian J; is simply 


Ji(a1) = sinh? a, (log a;). 


The exponent 2 comes from the fact that the a@,-eigenspace has dimension 2. 

Fix an arbitrary Haar measure dh on H. Flensted-Jensen computed the Jacobian 
for the generalized Cartan decomposition above; cf. [10] Theorem 2.6. In our 
specific case, the relevant integral formula becomes the following. 


Proposition 1.1. Given the above normalization of measures, there exist a function 
J’ and a constant c such that for all continuous functions on G with compact 
support we have: 


1. de F(g)dg = te ti de ftha,k) J! (a,)dkda,dh. 
2.0 ai) Seda): 
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Proof. The first statement is standard abstract nonsense following from the smooth 
decomposition G = HA, K. That the Jacobian J’ does not depend on the H and K 
coordinates is a simple consequence of the G-bi-invariance of the Haar measure dg. 
The specific form of J’ is immediate from [10], Theorem 2.6, the fact that az = 2a, 
and basic hyperbolic trigonometric identities. Note that the constant c will depend 
on the specific measure dh that was selected. 


1.3 Differential operators and the heat kernels 


With measures fixed, all that is needed to define the heat kernel is the second-order 
Casimir operator. Given an element Z € g, define the left-invariant differential 
operator 


Z(f)(g) = a: f (ge’”)|r=0, 


where f is any smooth function on G. The form B introduced above is nondegen- 
erate on g and positive definite on a, so we can use it to identify a and its dual. In 
particular, we define Y, to be the unique element in a such that B(Y,, X) = a(X) 
for every X € a. We remind the reader that the second-order Casimir operator of 
SL3(C), @, is defined by 


w= > ZF, (2) 


where {Z;} is any basis of g and {Z/} is the dual basis under B. The second-order 
Casimir of SO3(C), @1, is obtained by inserting subscripts | everywhere in (2), 
taking care to use the form B, in calculating the dual basis. Note that there is some 
positive constant c’ such that B|,, = c’ By. 

For our main result, we will compare the direct images of w and @, to Aj. 
Suppose we are given a function f; on A, that is invariant under the action of 
W,. Using the generalized Cartan decomposition G = HA,K, we extend f; to 
a function on G by defining f\(ha,k) = f\(a1). Given a differential operator D 
which is right K- and left H-invariant, the function D fi is again a W, invariant 
function on A,. Given an H -left-invariant, K -right-invariant differential operator on 
G, we can now project it to a differential operator on A;. Given such a differential 
operator D, we define its A, projection D by D f\(a) = Df, (a,), which is then a 
differential operator on A,; cf. [25] for a more formal treatment. 

Applying this to @, it is a result of van den Ban and Schlichtkrull [25] that the 
direct image of w to Aj is 


1 s 
© = — (wa, + 2(cotha, + tanha)Y, + 2(tanha) Yo.) . (3) 
c 
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where wa, is the Casimir (i.e., Laplacian) of A; determined by the form B, and 
the Y,, are the vectors dual to a; under B, fori = 1,2. Since cothx + tanhx = 
2 coth2x and 2Y,, = Yy,, we can simplify to obtain 


1 z 
® = (a, + 8 coth 4a) Yo). (4) 
Cc 
We remind the reader of the standard result, cf. [15], that the direct image of a, to 
A, using the G; = K,A,Kj, decomposition (i.e., the radial component of @ ) is 
given by 7 
@| = @4, +2cotha;Yo,. (5) 


We can now state the main theorem. 


Theorem 1.2. Let f be a K,-bi-invariant function on G; and F be an H_-left- 
invariant, K -right-invariant function on G such that for a, € Aj, 


F(a) = f (aj). 


Then i 
(oF (a) = [ (oi f (ai). 


Proof. This is essentially freshman calculus. It is easier to work on ay, with 
a, = e"! and viewing F and f as functions on a;. Under this transformation, wa, 
becomes the standard Laplacian on a), which we denote by A,,, and a becomes 
the standard directional derivative in the Y,, direction. 

As a mental aid for the computation, we write F = f o 4, where “4” means 
multiply the variable by 4. Evaluating (wF)(Y1) term by term, we get 


(Aa, F)(M1) = (Aa f 0 4)(%1) = 16(Aa, f)(4N), (6) 


8 coth 4a) (¥1)(dy,, F)(%) = 32 coth 4a (Y1)(y,, F)(4%). (7) 


Combining (4), (6) and (7) and using the fact that 4a, (VY) = a) (4Y), the theorem 
is immediate. 


This relationship between Casimir operators on G and G, leads immediately to 
a relationship between the solutions of their respective heat equations. We remind 
the reader that the heat Gaussian h,, t > 0 on G is the unique family of positive 
K-bi-invariant functions such that wh, = 0,h;, with total integral 1 for all t, and 
such that h, approaches the Dirac distribution as t approaches zero. 


Theorem 1.3. Let h; be the heat Gaussian of G. Then the restriction to A, of the 
heat Gaussian of G, is given by 


4 
hi, (a}) = | hy. (ha,)dh. 
CJH 16 
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Proof. We first verify that h;, satisfies the heat equation on G. Let 


“2 / hiv (hg)dh 
H 16 


denote the left H-average of h,.. considered as a function on all of G. Then 
16 


/ 
inde GwesS / catiesdns — / anaes 
H 16 16 H 16 


/ 


Cc 
i 


[ be thaydh = (om, ya’) 
H 16 


Let F and f be as in Theorem 1.2. To see that the total integral of hy, is equal to 
1 for all t, we use the integration formulas associated to the Cartan and generalized 
Cartan decompositions. Using the fact that a2 = 2a, we see that 


[ F(g)dg 


II 


c / F(e™) sinh? a (4Y;)dY, = ; / f(e") sinh? 01 (Yd Y1 
ay ay 


an f(giddgi. 


This proves the theorem. Oo 


We have deliberately left the constants c and c’ undetermined so that one can 
see directly how their choice of normalizations will affect the relationship between 
heat gaussians. However, the exponent of 4 on the left-hand side of Theorem 1.3 is 
structural and cannot be modified. Note that the number 4 also makes an appearance 
in Proposition 1.1 that is critical to the above proof. 

The number 4 appears in another context that we have not yet mentioned. Let p 
(resp. 1) be the sum of all the characters occurring in the decomposition of a (resp. 
a,) acting on n (resp. n,). Then 


Pla, = 4/1. 


Though it is not clear exactly how these two coefficients are related, it is more than 
coincidence. Indeed, in the case of Flensted-Jensen’s work, where G is a real form 
of G, the exponent of a; occurring in the theorem analogous to Theorem 1.3 above 
(cf. [9] Theorem 6.1) is 2. There is a similar relationship between p’s given by 


Pla = 2p1. 


Finally, in the case that G, arises as the semisimple part of a parabolic subgroup of 
G, no exponent is needed on a, and we have 


Pla; = PI: 
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It was stated in the introduction that it is not possible to extend Flensted-Jensen’s 
results to the case that G; is the fixed-point set of an arbitrary involution. The next- 
simplest case to check is that of G; = SOs5(C) C SL5(C) = G. Though there is a 
nice decomposition G = HG, K, the associated projection of differential operators 
does not yield a relationship between Casimir’s, similar to that of Theorem 1.2 so 
one cannot even get started in relating the heat kernels. We also note that in this case, 
the restriction of p to a; is not a multiple of p;. One encounters similar obstructions 
for other cases. 


2 Fundamental Domains for SO3(Z[i])\SO3(C)/SO(3), 
and for SO(2, 1)z\SO(2, 1)/SO(2) 


The first author has undertaken, in Chapter | of [6], a generalization of the classical 
theory of Ford fundamental domains (see §2.2 of [16]) for Fuchsian groups to 
a wide class of group actions including, in particular, [, = SL,(Z[i]) acting 
on G, = SL,(C)/SU() and GL(, Z) acting on GL(n, R)/SO(n). The present 
section carries out this theory in a very explicit manner in one concrete, low- 
dimensional case. This endeavor lies sufficiently outside the mainstream of the 
modern theory of automorphic forms that we find it necessary to preface the 
exposition with some remarks concerning the envisioned uses of such an explicit 
fundamental domain. 

In the case of GL(n, Z) acting on GL(n, R)/SO(n), the fundamental domains 
obtained in [6] coincide with the F;, studied by D. Grenier in [13] and [14] (allowing 
for the isomorphism of the symmetric space G/K with the quadratic model P). 
For this reason, we adopt the terminology Grenier domains for the generalized 
Ford domains. A major theme of Grenier’s work in these articles is that the F, 
for different n are best considered as part of an inductive scheme, since F,, for 
m <n appear both in the definition of F/, and in his construction of the Satake 
compactifications of the locally symmetric space GL(n, Z)\GL(n, R)/SO(n). The 
base case of Grenier’s inductive scheme is (ignoring the center of GL(n, R)) 
provided by Dirichlet’s classical fundamental domain for SL2(Z) acting on the 
upper half-plane. The results of this paper may be viewed as providing the base case 
for an inductive scheme of the same type corresponding to the sequence of locally 
symmetric spaces in (93), below. Note that the base case for this “orthogonal” 
sequence is considerably more complicated than the base case for Grenier’s “general 
linear” sequence. 

We take advantage of the well-known isomorphism 


SL»(C)/{+1} —> SO;(C), 


specified at the beginning of §2.1, to identify the lattice SO3(Z]i]) with a group of 
fractional linear transformations acting on H*. The concrete outcome of the present 
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section is to state explicitly what this arithmetic subgroup is in explicit matrix 
terms (Proposition 2.8) and give an appropriate fundamental domain, .¥ (Y), for 
the natural action on hyperbolic 3-space (Proposition 2.19). The reader who is solely 
interested in this result may wish simply to read 2.3, which is self-contained, and see 
the interactive graphics representation of the fundamental domain .F (¥Y), available 
on the internet at [3]. 

Following J.S. Friedman, A.B. Venkov, and A. Selberg, one defines the three- 
dimensional vector Selberg zeta function associated to a Kleinian group I" and 
a unitary representation xy of by 


CO 


Zry(s) =|] [ [deta — x) No(y)°), for Res > 0. (1) 
{y} k=0 


In (1), called an “Euler project” expression, {y} ranges over I"-conjugacy classes 
of primitive hyperbolic elements in I” and No(y) denotes the length of the closed 
geodesic on ’\G/K corresponding to y. The meromorphic continuation of Zr, 
(or, more precisely, of its logarithmic derivative Z’/Z) to the entire complex domain 
is closely related to an explicit form of the Selberg trace formula, worked out, for 
example, in [11] in parallel to [8]. Itis of obvious interest to obtain relations between 
the Zr, of the members of a pair of lattices (I, I’), where I” and I’ are related 
in various ways. For example, in the case of (I’, ”’) a pair of Fuchsian groups, with 
I’ CIF and [I : I’’] < 00 (with Z;,, defined similarly for Fuchsian groups), [26] 
gave a formula which is loosely called a “factorization formula,’ because in the case 
I’ normal in I’, it specializes to a bona fide factorization of Zr’, as the product 
of the Zr,,, where y; ranges over the irreducible direct summands of Ind}, X. 
In [5], we extended the factorization formula, now called “Artin formalism’, to 
arbitrary pairs (I, 1’) of commensurable Kleinian groups, dropping the assumption 
of normality. It is clear from the definition (1) that in order to apply the Artin 
formalism to particular pairs, such as (c~'(SO3(Z[i])), PSL2(Z[i])), one needs to 
develop concrete understanding of the relations between the hyperbolic conjugacy 
classes of the groups in question. Proposition 2.8, below, lays the foundations for 
that study. In §7 of [4] we derive formulas for the Selberg zeta function of SO3(Z[i]) 
(resp. SO(2, 1)z) in terms of PSL2(Z]i]) (resp. PSL2(Z)). 

In §2.6, we discuss the application of fundamental domains to the study of a more 
general class of spectral zeta functions. 

Based on the SL, /GL, examples in the literature, one can speculate on future 
applications of exact fundamental domains to traditional problems in number theory. 
Some diverse examples of applications of Grenier’s domain for GL,,(Z), acting on 
the space of positive definite real matrices P,,, include the proof in [7] of a bound 
on the first nontrivial eigenvalue of the Laplacian for the case n = 3 and the inves- 
tigations of [23] into the minima of Epstein’s zeta function. D. Hejhal’s pioneering 
work in the 1970s on explicit computer calculations of eigenvalues of the Laplacian 
acting on Maass forms relied on an explicit reduction algorithm for identifying an 
element of the quotient SL2(Z)\H? with a point in the Dirichlet domain. As part 
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of ongoing renaissance of this field of explicit computer computations related to 
automorphic forms (for a broad survey of which see the speakers and talks of 
the 2009 Séminaire de Mathématiques Supérieures, “Automorphic Forms and L- 
Functions: Computational aspects”, [24]), several generalizations of this explicit 
reduction algorithm have been developed and more will be needed. At the moment 
the subject of explicit computation of automorphic forms for groups of higher-rank 
remains in its infancy. It seems likely that as the detailed study of automorphic 
functions on quotients of SO,,(C) and its real forms becomes more developed, the 
exact fundamental domains, which the present paper specifies in the “base case” 
n = 2, will play a large role in investigating certain zeta functions associated to 
these arithmetic quotients. 

We mention the relation of Propositions 2.8, 2.19, and 2.27, below, to some 
results already in the literature. First, M. Babillot, in Lemma 3.2 of [1], constructs a 
fundamental domain for SO(2, 1)z acting naturally on the hyperboloid of one sheet. 
The method there bypasses results like Propositions 2.8 and 2.19 by embedding 
SO(2, 1)z as a subgroup of a triangle group of index two. The fundamental domain 
so obtained is used to give a constructive proof that SO(2,1)z acts with finite 
covolume, so that a general theorem can be applied to solve a lattice-point counting 
problem. Also, there is a well-developed theory of splines, which are models for 
the arithmetic quotients of Q-rank-one groups, in a way different from, but related 
to, (Grenier) fundamental domains. For a recent treatment with a general existence 
theorem and references, see [27]. It would be interesting (and possibly useful for 
cohomology calculations of the sort undertaken in [28]) to determine precisely the 
relation of “duality” that seems to exist between the splines and Grenier fundamental 
domains. However, this is more relevant to higher rank, and therefore, belongs more 
to the continuation of the study undertaken in [6] than to the study at hand. 

The verifications of all the principal propositions of the present paper are 
elementary, though lengthy, and they are not needed for the envisioned applications 
of the results. Accordingly, many details of proofs are omitted and the interested 
reader is referred to the electronically archived preprint [2] for them. 


2.1 Representation of SO3(Z[i]) as a lattice in SL2(C) 


We begin by establishing some basic notational conventions. Let 1 be a positive 
integer and o a ring. We will use Mat,(o) to denote the set of all m x n square 
matrices with coefficients in 0. We reserve use the Greek letters w, and so on, for the 
elements of Mat, (0), and the roman letters a, b,c, d, and so on, for the entries of the 
matrices. We will denote scalar mutliplication on Mat, (0) by simple juxtaposition. 
Thus, if o = Z[i], € € Z[i], and a € Mat»(Z]i]), then 


a= ab implies fa = fa £b 
“led , ~\ be ed)’ 
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The letters p,q, r,s will be reserved to denote a quadruple of elements of 0 such that 
ps —rq = 1. In what follows, we normally have 0 = Z/i] whenever a is written 
with entries p through s. Therefore, 


a= (’ ‘) € SL»(Zii]), 


Ss 


unless stated otherwise. 

We will denote a conjugation action of a group on a space V by cy when the 
context makes clear what this action is. For example, if H is a linear Lie group and 
h the Lie algebra of H, then we have 


ey(h)X =hXh"', forallhe H, X €b. 


Note that the morphism c,(/) is the image under the Lie functor of the usual 
conjugation e7(h) on the group level. Using SL(V) to denote the group of 
unimodular transformations of a vector space V, it is easy to see that 


cy : H — SL(b) is a Lie group morphism. (2) 


Henceforth, whenever H is a group acting on a Lie algebra h by conjugation, we 
will omit the subscript 5. Thus, we define 


C= Cp 


when we are in the situation of (2). 

Except in §2.2, we will use the notation G = SO3(C), = SO3(Z[i]). We use B 
to denote the half-trace form on sl,(C), the Lie algebra of traceless 2 x 2 matrices. 
That is, 


B(X,Y) = STHY). 


We use the notation 6’ = {X{, X3, Y’} for the “standard” basis of sl,(C), where 


xX, = ¢ He xX,= ({ ae and Y'= (( “) (3) 


The following properties of B are verified either immediately from the definition 
or by straightforward calculations. 


B1 B is nondegenerate. 
B2 Setting 


X,=X4+X5, X, =i(X/- XJ), and Y=Y’, (4) 


we obtain an orthonormal basis B = {X,, X2, Y} with respect to the bilinear 
form B. 


Some aspects of harmonic analysis on locally symmetric spaces 81 


B3 B is invariant under the conjugation action of SL2(C), meaning that 
B(X,Y) = B(e(g)Z,e(g)W), forall Z,W € sh(C), g € SL2(C). 
By B3, c is a morphism of SL2(C) into G. The content of part (a) of 
Proposition 2.1 below is that the morphism c just described is an epimorphism. 
As a consequence of B1 and B2, we have that 

B(xt X1 +5 Xo+-y Y,x7 Xi tx3XotyV) = xi xP+xgxptyly’, xy EC. 
(5) 
For any bilinear form B on a vector space V, we use O(B) to denote the group 


of linear transformations of V preserving B, and we use SO(B) to denote the 
unimodular subgroup of O(B). If B is as in (5), then the isomorphism 


SO(B) = G (6) 


induced by the identification of the vector space sl,(C) with C(X1, X2, Y) puts a 
system of coordinates on G. Part (b) of Proposition 2.1, below, will describe the 
epimorphism c : SL2(C) — G in terms of these coordinates. 


Proposition 2.1. With G, c as above, we have 


(a) The map ¢ induces an isomorphism 


SLi) se 


of Lie groups. 

(b) Relative to the standard coordinates on SLo(C) and the coordinates on G 
induced from the orthonormal basis B of sb(C), as defined in (4), the 
epimorphism ¢ : SL2(C) > G has the following coordinate expression: 


2.24.92 72  j(q2—p2 4. 42—q2 
a*—c*+d*—h*?  i(a*—c*+b*—d~) cd —ab 


2 
b . 3 
e((¢ ) — a a’—c?)  g2+c? ere i(ab +cd) . (7) 
-ac+bd i(ac+bd) ad+be 


We establish some further notational conventions regarding conjugation mappings. 
Whenever a matrix group H has a conjugation action cy on a finite-dimensional 
vector space V over a field F’, each basis 6 of V naturally induces a morphism 


cy.p: H + GLy(F), where N = dimV. (8) 


Let B, B’ be two bases of V. Write a?’ for the change-of-basis matrix from B to 
B’. That is, if 8, 6’ are written as N-entry row vectors, then 


Bab’ = B’, (9) 
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Then elementary linear algebra tells us that 


y = , 
Cy.p = CGLy(F) (Cis ) evs = CGLy(F) (a! =) Cy,p’. (10) 


Assuming that cy is injective, and writing cP for the left inverse of cy, we calculate 
from (10) that 


cy, pey}y € Aut(GLy (F)) is given by eoty(r) (a*?") (11) 


In keeping with the practice established after (2), we will omit the subscript h when 
H is a Lie group acting on its Lie algebra by conjugation. Thus, for any basis 6 
of b, 
Cg := Ch. 

Generally speaking, whenever we fix a single basis 6 for § we will blur the 
distinction between ¢ and cg. For example, in this paper, whenever H = SL2(C) 
and V = Lie(#), we will write c to denote both the “abstract” morphism ¢ of 
H into Aut(V) and the linear morphism eg of H into GL3(C), where B is the 
orthonormal basis for Lie(H) defined in (4). Whenever the linear morphism into 
GL3(C) is induced by a basis B’ # £, the notation cg will be used. 

We now turn our attention to the description of the inverse image c~!(I") as 
a subset of SLo(C)/{+/} with respect to the standard coordinates of SL2(C). 
According to Proposition 2.1, this amounts to describing the quadruples 


(a,b,c,d)€ C*, with ad — bc = 1, and the entries of the right side 
of (7) integers. (12) 


Describing the quadruples meeting conditions (12) will be the subject of the 
remainder of this section, culminating in Proposition 2.8. 


Conventions regarding multiplicative structure of Z[i]. Before stating the propo- 
sition, we establish certain conventions we will use when dealing with the multi- 
plicative properties of the Euclidean ring Z|i]. First, it is well known that Z[i] is a 
Euclidean, hence principal, ring. That Z[i] is principal means that all ideals 7% of 
Z{i] are generated by a single element m € Z[i], so that every 7 is of the form (m). 
However, there is an unavoidable ambiguity in the choice of generators caused by 
the presence in Z[i] of four units, i’, for 7 € {0,..., 3}, in Z[i]. We will adopt the 
following convention to sidestep the ambiguity caused by the group of units. 


Definition 2.2. We refer to the following subset of C* as the standard subset: 


{z € CX | Re(z) > 0, Im(z) = 0}. (13) 
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That is, the standard subset of C* is the union of the interior of the first quadrant 
and the positive real axis. An element of Z[i] in the standard subset will be referred 
to as a standard Gaussian integer, or more simply as a standard integer when the 
context is clear. 


Because of the units in Z[i], each nonzero ideal .¥ of Z[i] has precisely one 
generator which is a standard integer. Henceforth, we refer to the generator of 7 
which is a standard integer as the standard generator of .%. Unless otherwise 
stated, whenever we write .% = (m), to indicate the ideal 4% generated by an 
m € Zji], it will be understood that m is standard. Conversely, whenever we write 
an ideal .¥ in the form (mm), it will be understood that m is the standard generator 
of .%. Thus, for example, since (1 — i) = i#3(1 + i) with 1 + i standard, we write 
Y =: (1 —i)Z[i], defined as the ideal of Gaussian integers divisible by 1 — i, in the 
form % = (1 +i). 

Similar comments apply to Gaussian primes, factorization, and greatest common 
divisor in Z[i]. By a “prime in Z[i]” we will always mean a standard prime. By 
“prime factorization” in Z[i] we will always mean factorization into a product of 
standard primes, multiplied by the appropriate unit factor. Note that the convention 
regarding standard primes uniquely determines the unit factor in a prime factoriza- 
tion. For example, since 

2=(1+3) 


and (1 + i)? is standard, the above expression is the standard factorization of the 
Gaussian integer 2, and i is uniquely determined as the standard unit factor in the 
prime factorization of 2 € Zi]. 

By convention, unless stated otherwise, the “trivial ideal” Z[i] will be understood 
to belong to the set of ideals of Z[i]. The standard generator of the trivial ideal Z[i] 
is, of course, 1. 

To facilitate the statement of Proposition 2.8, we estblish the following conven- 
tions. First, we use wg to denote the unique primitive eighth root of unity in the 
standard set of C*. Observe that 


3 
om = 20 +3) and w2 =i. (14) 


The SL2(Z[i])-space MY. 


Definition 2.3. For N € Ziil, M) will denote the subset of Mat2(Z[i]) consisting 
of the elements with determinant N. Since the group SL(Z[i]) acts on M¥’ by 
multiplication on the left, M3’ is an SL2(Z[i])-space. 


It is not difficult to see that the action of SL(Z[i]) on MY fails to be transitive, 
so MY is not an SL2(Z[i])-homogeneous space. The purpose of the subsequent 
definitions and results is to give a description of the orbit structure of the SL» (Z[i])- 
space MY’, 

Let 


§2, := a fixed set of representatives of Z[i]/(y), for all y € Zi. (15) 
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It is clear that for each y € Z[i], there exists a number of possible choices for 92y. 
For the general result, Proposition 2.6, below, the choice of 2, does not matter, 
and we leave it unspecified. However, in the specific applications of Proposition 
2.6, where y is always of the form y = (1 + i)” for a positive integer, it will be 
essential to give an (2, explicitly, which we now do. 

So letn € N,n > 1. In the definition of 2(;+i),, we use the “ceiling” notation, 
defined by 

[q| = smallest integer > g, forg € Q. 


Now set 
Qazi = \r +siwithr,s€Z,0<r<2/2!|,0<s< ara . (16) 


The definition is justified by Lemma 2.4, below. 
Lemma 2.4. Forn > 1 an integer, let 2(14i be defined as (16). Then 


Qa+i is a complete set of representatives of Z[i|/(( + i)")) for all n. 


Definition 2.5. Let N € Z[i] be fixed, and for each y € Z[i] let (2, be as in (15). 
Define the matrix a\(m, x) € M¥ as follows: 


aN(m, x) = [ i) form € Z[i], m|N, x € Qn. (17) 


m 


It is trivial to verify that wN(m, x) as given by (17) indeed has determinant N, i.e., 
aN(m,x) € M} . The point of Definition 2.5 is given by the following proposition. 


Proposition 2.6. For N € Z[i] — {0}, let M be the SL2(Z[i])-space of matrices 
with entries in Z[i] and determinant N. Define the matrices aN(m, x) as in (17). 
Then 


My = | 'S SLo(Z|i])a® (m, x), (18) 
mE€Z{i]| m|N, xEQ N 
7 Standard ne 


and (18) gives the decomposition of the SL2(Z{i])-space M} into distinct SL» (Z{i])- 
orbits. 


We now make some comments concerning the significance of Proposition 2.6. 
First, a statement equivalent to Proposition 2.6 is that an arbitrary a € My has a 
uniquely determined product decomposition of the form 


N 
o= (: >) = (’ a & x). withm € 0, m|N, = standard, 


m 


x€2n, pr-—qs=1. (19) 


Some aspects of harmonic analysis on locally symmetric spaces 85 


The uniqueness is derived from Proposition 2.6 as follows. The second matrix in the 
product of (19) is uniquely determined by the matrix a because of the disjointness 
of the union in (18). The first matrix in the product appearing in (19) is therefore 
also uniquely determined. 

The second remark is that Proposition 2.6 may be thought of as the Gaussian- 
integer version of the decomposition of elements of Mat2(Z) of fixed determinant 
N, sometimes known as the Hecke decomposition. Occasionally we refer to (19) 
as the Gaussian Hecke decomposition, to distinguish it from this classical Hecke 
decomposition in the context of the rational integers. 

The proof is the same as that of the classical decomposition except for some care 
that has to be taken because of the presence of additional units in Z[i]. For the 
classical Hecke decomposition, see page 110, §VII.4, of [21], which is the source of 
our notation for the Gaussian version. 


Statement of the Main Result of §2.1. Let & be an arbitrary subset of SL2(Zi]). 
Suppose, at first, that F is actually a subgroup of SL2(Zfi]). Since SL2(Zfi])aN (m, x) 
is an SL2(Z[i])-space, it is also a &-space. For general subgroups &', however, the 
action of & on SL2(Z{i])aN(m, x) fails to be transitive, i.e., SL2(Z[i])aN (m, x) 
is not a &-homogeneous space. We will now describe the orbit structure of 
SL2(Z[i]J)aN(m, x) for a specific subgroup J. In order to make the description 
of the subgroup and some related subsets of SL2(Z[i]) easier, we introduce the 
epimorphism 
red} 4; : SLo(Zf[i]) > SL2(ZIi]/( + i) 


by inducing from the reduction map 


redi4i: Z[i] > Z[iJ/(1 +. 


That is, we “extend” red,+; from elements to matrices by setting 


teal, Pq = redj4;p red\+ig (20) 
rs redj4ir red,+445 
Since 214; = {0,1}, we may identify Z[i]/(1 + i) with {0, 1}. Similarly to the 


convention with p,q,r,s € Z[i], we use (p,g,7,5) to denote a quadruple of 
elements of Z[i]/(1 + i) such that 


ps—Fg=l. 


There are two elements of SL2(Z[i]/(1 + i)) of particular interest: 


io (( + i= & ») € SLi(ZfiJ/(1 +i). (21) 
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The notation in (21) is chosen to remind the reader that J = red,4;(/) and S = 
red,+;($), where 7, S are the standard generators of SL2(Z), as in §VI.1 of [19]. 
Since S = T, it is easy to see that {7, S} is a subgroup of SL2(Z[i]/(1 + i)). Now 
define 

By. = red), ,({7, S}). (22) 


Since red; +4; is a morphism, 5» is a subgroup of SL2(Z|i]). 
Also, using the epimorphism red) +;, we define the following subsets of SL2(Z|i]): 


a amsi({9) G2) 
=e): i)}): 


lend 


(The subscripts on the & of (22) and (23) are chosen in order to remind the 
reader of the column in which zeros appear in the matrices of red)4;(4).) Since 
SL2(Zfi]/(1 + i)) consists of the elements /, S' and the four elements appearing on 
the right-hand side of (23), and red; +; is an epimorphism, we have 


Q 


SL(Zii) = (J Sl) Sn. (24) 


Unlike 41, the subsets &, and & of SL2(Z|i]) are not subgroups. 
All three subsets & in (22) and (23), though, have a description of the following 
sort, which gives some insight into the reason for Sublemma 2.7, below. For fixed 


(1 1), 
(pq). (F 5) € 4 (10), ¢ C SLoZfil/d +i)”, 
01) 


we have 


c-msi(()-GQ)) 


For example, we obtain 47 by taking 


in (25). 

The reason for introducing the subsets & of (23) is that they allow us, in 
Sublemma 2.7 below, to describe precisely the orbit structure of the &)2-space 
SL2(Zfi])oN (mn, x). 
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Sublemma 2.7 Using the notation of (17) and (23), we have 


SL2(Zfi])a% (m, x) = 'S SaN(m, x). (26) 


Each of the three sets in the union (26) is closed under the action, by left- 
multiplication, of 2,2 on SL2(Zfi])aN (m, x) and equals precisely one 5,7-orbit in 
the space SL2(Zfi])aN (m, x). 


Proposition 2.8. Let ¢ be the morphism from SL2(C) onto G as in (7). Let T= 
SO3(Z|i]) be the group of integral points of G in the coordinatization of G induced 
by the isomorphism (6). Let the subsets 5, G2, &\2 of SLa(ZIi]) be as defined in 
(22) and (23). Let the matrices aN(m, x) be as in (17). Let wg € C be as in (14). 
Then we have 


1 3 1 +3 
—] fed 1°25 mi e145 se 
el(r)= |-J (Jpn (i, 0) | -} (u Sra (i »)). 
3,=0.1 wf €=0,1 wg (1 + i) 

(27) 


Remarks 


(a) We use Z[w]| to denote the ring generated over Z by ws. By (14) we have Z|i] C 
Zag] and Z[wg] = Zlag. i]. It follows from Proposition 2.8 that e!(") C 
SL2(C) is in fact a subset of SL3(Q(@)). More precisely, of the two parts of the 
right-hand side of (27), we have 


1 , 
— Ena (5,0) C SLa(Zfi,s}) for 8 € (0, 1}, (28) 
Ws 
while 
\.) 1} a gait gts if)) CSL, (<| wg il) for 6 € {0, 1}. 
ao (+9) — nw T+i 
(29) 


(b) One can easily verify that the set on the left-hand side of (28) is closed under 
multiplication, while the set on the left-hand side of (29) is not. More precisely, 
through a rather lengthy calculation, not included here, one verifies that 


of form (29) 
for (x, y) a pair of elements of the form of (29), xy is or (30) 


of form (28). 


with each possibility in (30) being realized for an appropriate pair (x, y). These 
calculations amount to a brute-force verification of the fact that the right-hand 
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side of (27) is closed under multiplication. But because I” is a group and c a 
morphism, this fact also follows from Proposition 2.8. 


The explicit representation of e~!(J”) in Proposition 2.8 allows us to read off 
certain group-theoretic facts relating c~!(I”) to SL2(Zfi]). In Lemma 2.9 below we 
use the notation 


[G : H] is the index of H in G, for any group G with subgroup H. 


Lemma 2.9. Let c!(I") be the subgroup of SL2(C) described above, given 
explicitly in matrix form in (27). All the other notation is also as in 
Proposition 2.8. 


(a) We have 
e' (7) N SLa(Zfi]) = Ep. 


(b) We have 
[e1W): Sy] =6, [SLo(Zfi]) : B12] = 3. (31) 


Explicitly, the six right cosets of 2 in e~'(I’) are the two cosets obtained by 
letting 6 range over {0, 1} in 


1 dns 1 i 0 
—& i,0)=—4 
we 12a (I, 0) we 12 & 1 


and the four cosets obtained by letting 6, € range over {0, 1} independently in 


I aw {10\ at eite se 1 o ce iv ) 
————_— 8 a i’. *) = ———24 aie wd 
w§(1 + i) 2(74) ob +iy Vito 2 +i 


2.2. Good Grenier fundamental domains for arithmetic groups 
T € Aut* (H’) 


We begin with the following definition, which is fundamental to everything that 
follows. 


Definition. Let X be a topological space. Suppose that I” is a group acting 
topologically on X, ie, I C Iso(X). A subset ¥ of X is called an exact 
fundamental domain for the action of [ on X if the following conditions are 
satisfied: 


FD 1. The J-translates of ¥Y cover X, i.e., 


X=TF. 
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FD 2. Distinct -translates of ¥ intersect only on their boundaries, i.e., 
vV1,¥2 EL, yi A yo implies yp» F Nyro F C yidF, y20F. 


Henceforth, we will drop the word exact and refer to such an ¥ simply as a 
fundamental domain. 


For the current section, §2.2, only, G, instead of denoting SO3(C), will denote 
SL2(C). Likewise, instead of denoting SO3(Z|i]) or e~!(SO3(Z[i])), I” will denote 
an arbitrary subgroup of SL2(C) satisfying certain conditions to be given below. 
The main examples to keep in mind are first, 7 = SL2(Z), the integer subgroup of 
SL2(C), and second, ” = c~!(SO3(ZIi])), the inverse image of the integer subgroup 
of SO3(C), described explicitly as a group of fractional linear transformations in 
Proposition 2.8. 


Iwasawa decomposition of SL2(C). For the reader’s convenience, we recall only 
those results in the context of SL2(C) which we need to proceed. For proofs and the 
statements for SL, (C), see the “Notation and Terminology” section of [20]. Let 


v=1(01) 


ae & A | we Ret diagonal elements of SL2(C) with positive 


xe ct. upper triangular unipotent matrices in SL2(C), 


diagonal entries, 


K = SU(2) = {k € SL2(C) | kk* = 1}. 


Here x* denotes the conjugate-transpose x" of x. 
We have the Iwasawa decomposition 


SL:(C) = UAK, 
and the product map U x A x K — UAK is a differential isomorphism. 

The Iwasawa decomposition induces a system of coordinates @ on the symmetric 
space SL>(C)/K. The mapping ¢ is a diffeomorphism between SL2(C)/K and R°. 
The details are as follows. The Iwasawa decomposition gives a uniquely determined 
product decomposition of gK € SL2(C)/K as 

gK = u(g)a(g)K, whereu(g) € U, a(g) € A are uniquely determined by gK. 


Define the Iwasawa coordinates x; (g), x2(g) € R, y(g) € R® by the relations 


w= i x1(g) : las, . wae oa oa) 
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By the Iwasawa decomposition, the Iwasawa coordinates of g are uniquely deter- 
mined. We emphasize that while x; (g) and x2(g) range over all the real numbers, 
y(g) ranges over the positive numbers. As functions on G, x; x2, and y are invariant 
under right-multiplication by K. Thus x;, x2, and y induce coordinates on G/K. 
Now define the coordinate mappings ¢; : SL2(C)/K — R, fori = 1,2,3, by 


og, =—logy, gd2=x1, $3 =X, (32) 


and set 
b = ($1.92, 63): G/K > R’. 


The mapping ¢ is a diffeomorphism of G/K onto R3, because the Iwasawa 
coordinate system is a diffeomorphism, as is log. Thus, there exists the inverse 
diffeomorphism 

¢@!:R? > G/K. 


By (32), we can write, explicitly, 


$ (ti. t,t3) =b+bite“j, forall t = (t1,t2,t3) € R?. (33) 


The quaternion model and the coordinate system on SL2(C)/K. We will use 
the model G/K as the upper half-space H?, defined as the following subset of the 
quaternions: 


H? = {x1 + x2i + yj, where x}, x. € R, ye Rt}. (34) 


Recall that SL2(C) acts transitively on H? by fractional linear transformations. See 
§VL.0 of [19] for the details of the action. We note the relation 


gj = x1(g) + x2(g)it y(g)j. (35) 


As a result of (35) and the Iwasawa decomposition, we may identify SL2(C)/K 
with H?. So ¢ : G/K — R? induces a diffeomorphism 


¢: 13 => R’. 


Because of (35), if g is any element of G such that gj = z, then (g) = (2). 
Further, beause of the way we set up the coordinates on H?, ¢ : H* > R? is given 
explicitly by the same formulas as (32). 
As explained in, for example, §VI.0 of [19], the kernel of the action of SL2(C) 
on H? is precisely the set {+/}, consisting of the identity matrix and its negative. 
For any oriented manifold X equipped with a metric, use the notation 


Aut* (X) = group of orientation-preserving isometric automorphisms of X. 
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It is a fact that every element of Aut*(H*) is realized by a fractional linear 
transformation in SL2(C), unique up to multiplication by +1. Therefore, the action 
of SL2(C) on H? by fractional linear transformations induces an isomorphism 


SLo(C)/{+1} = Aut* (H?). (36) 


The stabilizer in I’ of the first 7 @-coordinates. In all that follows, if i, 7 € N, 
the notation [/, j] is used to denote the interval of integers from i to j, inclusive. 
The interval [i, /] is defined to be the empty set ifi > /. 


Definition 2.10. For i, j € {1,2,3}, withi < j, let pj;,;1 be the projection of H? 
onto the [i, j] factors of R?. In other words, we let 


Pfi,j] = i> Pit1,--+,j)- 
Since ¢ is a diffeomorphism of H?, ¢j;,;; is a smooth epimorphism of H? onto 
Ri-/+1. 
If % is any subset of {1, 2, 3}, of size |.%|, then we can generalize in the obvious 
way to define the smooth epimorphism 


ox > RIM, 


Let I” be a group acting by diffeomorphisms of H?. For y € Ir we also use y to 
denote the diffeomorphism of H? defined by the left action of y on H. Therefore, 
for] € {1,...,3} the composition ¢ 0 y is the R-valued function on H? defined by 


61 0 y(z) = @(yz)_ forall z¢ H’. 


We use I°%./] to denote the subgroup of I” whose action stabilizes the first i 
coordinates. In other words, we set 


res — {y E rr | Pu.j] = Pui] fe) y}. 
We extend the definition of t'./1 to 7 = 0, 4 by adopting the conventions 
reo =P and Peel = 1. 
Note that by definition, we have the descending sequence of groups 
r=r%v0o > r?! > peu > reus > rea = 1. 
Note that the penultimate group in this sequence, namely I%'3J, equals, by 
definition, the kernel of the action of 7 on H°. Assuming that ” C SL2(C), Le., 


that I” consists of fractional linear transformations, we always have 


real = PO {+l}. (37) 
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Because the [”?t./] form a descending sequence, fork, j € {1,2,3} withk < j, 
we can consider the left cosets of "#1 in ’%', The left cosets are the sets of 
the form ly, for yy € Ce, Now let i, j,k € {1,2,3},1 < j,k < j. By 
the definition of '%'./), the function ¢) 0 y; depends only on the left ?-/)-coset 
to which y; belongs. Therefore, for fixed z we may consider ¢; © yx(z) to be a 
well-defined function on the set of left cosets Py, of P94 in '9t.], We may 
therefore speak of the R-valued function ¢; 0 P'%-/ly,. 

In what follows we will most often apply the immediately preceding paragraph 
when / = j andk = j — 1. For y € P'%/-"! and A an arbitrary subset of '%'./), 
we have 

$j (Ayz) = {bj (va}.- (38) 
Therefore, by setting 
$j oly) = 6) (72), 


we obtain a well-defined function 
go Pty: HOR. 


The function ¢; o ”%t'/ly depends only on the [“#t-/]-coset to which y belongs. 
For y € [#t.j—1l, the R-valued function ¢; 0 "'/y gives the effect of the action 
of y € %.J-" on the jth coordinate of a point. It is clear from the definition that 


¢; = ¢; °y if and only if '*"/y is the identity left coset of 7) in Pt-J-", 
(39) 


Sections of projections and induced actions of I’. As before, suppose that I” is a 
group acting by diffeomorphisms on H?, and let '%!') for 7 € {1,2, 3} be defined 
as above. 

For any subset .% of the interval of integers [1,3], we let #° = [1,3] —% be 
the complement of X in [1,3]. 


Definition 2.11. Let f be a real-valued function 
f: oR. 


Let .% a subset of [1,3]. We say that f is independent of the .% coordinates if 
for every x, y € H’, 


bxe(x) = Pxe(y) implies f(x) = f(y). 


In other words, f is independent of the coordinates in .% if and only if f is 
constant on the fibers of the projection ¢~<« onto the R-factors indexed by .%°. 

For the next observation, we need to introduce the notion of a section of a 
projection $7. It will not really matter which section we use, so for simplicity, 
we choose the zero section. For a subinterval [i, 7] of {1,2,3} of size 7 —i + 1, 
define 
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oF] :R/TI+! _, pp 

by 
OF ;\X1, caspase oO Mia eS nO) 
i-l 3-j 

The map OF, ;] 18 called the zero section of the projection $j;,;}. The terminology 
comes from the relation 

bij. j) = Idpi-jt+i, (40) 
which is immediately verified. The concept of the zero section of the projection can 
be generalized from the case of a projection associated with an interval [7, j] to that 
of an arbitrary subset .% of {1, 2,3}, in the obvious way, although we will not have 
any use for this generalization in the present context. 

By use of the zero section, we are able to make a useful reformulation of the 


condition that f : H? — R is independent of the first 7 — 1 coordinates. Let 
j € {2,3} and f areal-valued function on H?. Then 


f is independent of the first 7 — l1coordinates if and only if f oF Pl 73] 
= ou) f (41) 


The reformulation (41) allows us to prove the following result. 


Lemma 2.12. Let A be a group acting on H’, and for j € {1,2,3}, let b1;,3 be 
the projection of H? onto the last 3 — j + 1 coordinates and let One be the zero 
section of $1;,3}. Suppose that for alll € [j,3] and 6 € A the functions ¢) © 6 are 
independent of the first j — 1 coordinates. Then A has an induced action on R3~/*! 
defined by 


Sy.) = o43605,3), forall t=(h,....-j41)€ R/T. (42) 


It is an immediate consequence of the definitions that for any group ¥ acting on 
H? by diffeomorphisms, and any subgroup I” of I”, we have, for 1 <i < j < 3, 


rei = (F/T. (43) 
Applying (43) to the case of ” = SL(C) andi = 1, we deduce that 
ret. = PN SL(C)*-/, (44) 
for any subgroup I” C SL2(C). Because of (44) it is very useful to have an explicit 
expression for SL7(C)*!. We carry out the calculation using the relations of (32). 


Let z € H? with 
Z=HEX+x.4+ yj, 
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as in (34). Let 


g € SL2(C) with g = (: A) 
cd 


Define 


y(z) 


2, 45 
lez + |? wee 


y(c,d;z) = 


where in (45) and from now on, for a quaternion z, ||z||? denotes the squared norm 
of z, so that ||z||? = zz. Then we have 


y(gz) = y(c, d;z). (46) 
For the details of such calculations, see §VI.0 of [19]. Since 
#1 : H’ = Ris defined as — log y(-), 
and log is injective, (46) implies that 
g € SL2(C)® if and only if y(c, d;z) = y(z) forall z € H®. (47) 
By (47) and (45), we have 
g € SL,(C) if and only if ||ez + d|? = 1 forall z ¢ H’. (48) 


Clearly, the condition ||cz + d ||? = 1 is satisfied for all z € H? if and only if c = 0 
and ||d || = 1. We therefore deduce from (48) that 


-1 
wor {(5) 


As a result of (49), we can easily verify that for y € *',/ € [2,3], the functions 
gi © 6 are independent of the first coordinate. So we can apply Lemma 2.12, in this 
case, with 7 = 2 and deduce the following: 


x, we, jlol|=1;. (49) 


Lemma 2.13. Let [ ¢ Aut* (H?), let $p.31 be the projection of H? onto the last 
two coordinates, and let OD 3 be the zero section of 12,3}. Then I” has an induced 


action on R? defined by 


yest) = dp3(vop.y®). forall t= (th, h) € R’. (50) 
The following theorem is a special case of the main result of the first chapter 
of [6]. 


Theorem 2.14. Let I be a subgroup of SL2(C), acting on H® on the left by 
fractional linear transformations. Suppose that I" is commensurable with SLz(Zii)). 
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Let Y be a fundamental domain for the induced action of ['%23)/{+1} on R?. 
Assume further that Y = Int(Y). Define 


F, ={z€ HW | o1(2) < $1 (y2), forall y €T}. (51) 


Set 
F(4) = bn yA. (52) 


(a) We have ¥(Y) a fundamental domain for the action of I’ /{-+1} on HP. 
(b) We have 
F (4) = Int(F(Y)). (53) 


(c) Further, Int(.F;) and Int(.F (Y)) have explicit descriptions as follows: 
Int(F,) = {z € H’ | di(z) < ¢1(y2), forally € FP -T*'} (54) 


and 


Int(.F(Y)) = $p'y(Int(Y)) N Int(F1). (55) 


Considering the coordinate system ¢ on H® as fixed, we may think of the 
fundamental domain .¥ for I"%!'31\I" to be a function of the fundamental domain Y 
for the induced action of ’*! on R*. When we wish to stress this dependence of .F 
on Y, we will write F(Y) instead of F. 


Definition 2.15. Suppose that [ C Aut*(H*) is commensurable with SL2(ZIi]). 
Let Y be a fundamental domain for the induced action of [3i\I"%' on R? 
satisfying Y = Int(Y). Then the fundamental domain ¥(¥Y) for the action of 
I%u31\T defined in (52) is called the good Grenier fundamental domain for the 
action of I on H? associated to the fundamental domain %. 


The reference to the fundamental domain ¥ is often omitted in practice. 
Henceforth, we drop the explicit reference to ”%!'3] and speak of a fundamental 
domain of '%'3|\T" as a fundamental domain of I’. By (37), I’ is at worst a twofold 
cover of [7%l'3)\”, so this involves only a minor abuse of terminology. 
We will give an expression for a good Grenier fundamental domain ¥ (Y) for 
c—'(SO3(Z[i])) in terms of explicit inequalities, in (73), and again as a convex 
polytope in H?, in Proposition 2.19, below. 


Example: The Picard domain ¥ for SL2(Z[i]). Define the following rectangle 


in R*: 
: phage) ; -) . 


It is easy to verify, from an explicit description of SL2(Z[i])*! deduced from (49), 
that %y (zp is a fundamental domain for the action of SL2(Z fi)?! /{41}. 


siz) = iar th) € R* 
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Further, it is obvious that 


Stage = Int(Ge, zips )- 


Therefore, Theorem 2.14 applies. We deduce that with 71, ¥(%,(zjipo) 
defined as in Theorem 2.14, we have 


F ‘= F (Gg, (a4i)01) is a good Grenier fundamental domain for SL2(Zfi)). 


The fundamental domain ¥ is defined in §VI.1 of [19], where, in keeping with 
classical terminology, ¥ is called the Picard domain. 

In order to complete the example, we now give an explicit description of the 
set YF, which will allow the reader to see that “our” ¥ is exactly the same as the 
Picard domain. It can be shown that .¥; is the subset of R* whose image under the 
diffeomorphism ¢~! is given as follows: 


& (A) = {2 € H | ||z— ml] = 1, forallm € Z[ij}. (57) 
Of the infinite set of inequalities defining ;, all except the one with d = 0, 
i.e., ||z||? > 1, are trivially satisfied on $0.5) (Sei ati ). Thus, from (57), (56), and 


(52), we recover the description of the Picard domain by finitely many inequalities 
given in §VI.1 of [19]: 


F (Ss, (zpie1) = ‘2 ¢ H’ 


€ La elo, Iz? > 1 (58) 
a 975 |? 22 = eg eee See 


2.3 Explicit description of the fundamental domain 
for the action of SO3(Z[i]) on H? 


We now proceed to consider the special case of e~'(SO3(Z[i])) in Theorem 2.14 
above. In keeping with the general practice of the present paper, we will go back 
to using G to denote SO3(C) exclusively, and I" to denote the group SO3(Z|i]). 
Since we are always in this section in the setting of subgroups of SL2(C), we will 
abuse notation slightly and use I” to denote the isomorphic inverse image ec! (I”) 
of I” = SO3(Z|i]) in SL2(C). 

Also, we treat R?, the image of the projection $12.3], as C, by identifying the point 
(t),t2) € R* with 4) + it). Thus, our “new” ¢i2,3] is defined in terms of the “old” 
¢-coordinates by 


12,3] (Z) = G2(z) + if3(z). (59) 


Proposition 2.16. First form of Y,. Let YF, be as defined in (51). All other 
notation has the same meaning as in Theorem 2.14. Then we have 
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Fi ={2=x@+y@je HW | |lx@-Ad|?+y@* = 2, ford €1+(1+i)Z fi}, 

(60) 
and Int(.¥;) is the same as in (60), but with strict inequality instead of nonstrict 
inequality. 


Fundamental domain % for '*!. In order to complete the explicit determination 
of a good Grenier fundamental domain ¥ for I”, it remains to describe a suitable 
fundamental domain Y for "®'. Using (44), (49), and the description of I" in (27), 
we deduce that 


5 )5 
re = & a | be(1+HZ[i], de ont (61) 
8 


It follows from (61) that the subgroup of unipotent elements of I"?! is 


ay — (1 0+Zfi] 
ee = i 1 ) . (62) 


We make note of certain group-theoretic properties of '%! and (I”%')y that are used 


in determining the fundamental domains. First, we define the following generating 
elements: 


wg 0 11+i 11-i 
Ri = Ti44 = : d Tis= . 
(Sop) Peal tT!) me tae ('T) © 


It is easily verified, using (61) and (62), that 
)y = (Ti, Tia), Fe = (Rez, Ti, Ti). (64) 
We calculate, from the definition of Rz and (64), that 
(RC y = (Fu. 
Since I"?! is generated by (I"?')y and Rz, and Rz has order 4, we deduce that 
(7*)y is normal in 7® with [7" : (F™)y] = 4. (65) 
Let T be any element of (I”?')y. Then we have a more precise version of (65): 


The group (7R =) of order 4 is a set of representatives 


for the coset group?! /("®!)y. (66) 


98 E. Brenner and A. Sinton 


Applying (66) to the case T = T\_;, we have the following: The group (TiiRz) 
of order 4 is a set of representatives for the coset group?! /(I"*')y. 
(67) 
It is easily verified that the action of Rz on C is rotation by an angle 7 /2 about 
the fixed point 0. Furthermore, we calculate from (63) that 


T\iRz = c(7))Rz. 


Therefore, 


The action of T\iRz on C is rotation by 2/2 about 1. (68) 


The following statement is a special case of Lemma 2.2.7 of [2]. 


Lemma 2.17. Let Gy be a fundamental domain for the action of (®')y on H? 
satisfying 

Ti 4iR2() = W. 
Let Y be a fundamental domain for the action of (T\4iR«x) on GY. Then G a 
fundamental domain for the action of '?! on H?. 


In order to define and work with the sets G and Y which will be fundamental 
domains for the action of I; : ' and P’?!, it is useful to introduce the notion of a 
convex hull in a totally geodesic metric space. 

A metric space (X, d@) will be called totally geodesic if for every pair of points 
Pi, Pp2 € X, pi # po there is a unique geodesic segment connecting pi, p2. In 
this situation, the (closed) geodesic segment connecting p1, P2 will be denoted by 
[P1, Pola. A point x € X is said to lie between p; and pz when x lies on [p1, pola. 
We then say that Y C X is convex when p), p2 € @ and p3 between p; and po 


implies that p3 € 7. Let pi,..., p, be r points in X. The points determine a set 
Ca(Pi,---s Pr): 
called the convex closure of 71,..., p,, described as the smallest convex subset of 


X containing the set {p1,..., Dr}. 

Obviously, we can apply the notion of convex hull to any set .”, rather than 
a finite set of points. The definition remains the same, namely that @(.%) is the 
smallest convex subset of X containing .”. In general we will use the notation 


lS) = te U a) 


In particular, if we apply these notions to X = R? with the ordinary Euclidean 
metric Euc, then the geodesic segment [p1, P2]ruc is just the line segment joining 
P1, P2. Further, provided that not all the p; are collinear, @(p1,... p,) is a closed 
convex polygon whose vertices are located at a subset of {p1,..., pr}. 


Some aspects of harmonic analysis on locally symmetric spaces 99 


We first use the notion of convex closure to record an elementary facts concerning 
the fundamental domains of groups of translations acting on R?, identified with C 
in the usual way. Let @,,@2 € C be linearly independent over R. Then Za, + Za 
is a lattice in C, and it is well known that all lattices in C are of this form for suitable 
@ , @2. Let T denote the group of translations by elements of Zw, + Z@2 acting on 
C. Then we have 


€(0, @1, @2, ©; + w2) is a fundamental domain for the action of Z@, + Zw on C. 


(69) 
Now we define the following polygons in C & R?. Let 
Gy = Cuc(0,2,1+i,1—-i), 
and let 


The relation between the polygons is that % is a square centered at 1, while ¥ is 
an isosceles right triangle inside %, with vertices at the center of Y and two of the 
corners of yy. Therefore, it follows from (68) that we have 


G = (J Msiks)Y, with (T1iR2)'YOY C dG, fori #0 mod 4. 
i=0,1,2,3 
(71) 
The relations (69) and (71) lead to the following lemma. 


Lemma 2.18. Let '®' be as given in (61) and ('®')y as given in (62). 


(a) The set Gy is a fundamental domain for the induced action of ('*)y on 
CR’. 

(b) GF is a fundamental domain for the induced action of (T\4iRz) on GW. 

(c) The set Y is a fundamental domain for the induced action of '*! on C = R?. 


Form of .¥ in terms of explicit inequalities. Combining Part (c) of Lemma 2.18, 
Proposition 2.16, and (52), we deduce that 


F(G) = {2€ HW | op. 3(2 € Gal, 2,149, |x(2) — ml)? + yp’ = 2, 
form é€1+ (1 +i)Z[i}. 


By (59), the first condition in the description of ¥ (Y) above may be replaced by 
x(z) € @puc(1, 2,1 4+ i). (72) 


Let z € C satisfy (72). The element m = 1 is the element of 1 + (1 + i)Z[i] closest 
to x(z). Therefore, for z satisfying (72), the condition 


\|x(z) — m||? + yz)’ = 2, forallm €1+(1+iZii] 
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reduces to ||x(z) — 1||? + y(z)? = 2. So we may rewrite the description of F(Y) in 
the form 


F(G) = {2 € W x(z) € Gauc(1,2,1 +i), |[x(2) — 1]? + y@? = 2}. (73) 


Additional facts regarding convex hulls and totally geodesic hypersurfaces 
in H3. We now extend our “geodesic hull” treatment of Y from the boundary into 
the interior of H*. We first recall certain additional facts regarding convex hulls and 
totally geodesic hypersurfaces in H?. 

The description of the geodesics in H* is well known, but the corresponding 
description of the totally geodesic surfaces in H? perhaps not as well known, so we 
recall it here. Henceforth we abbreviate “totally geodesic” by t.g. Although all t.g. 
surfaces are related by isometries, in our model they have two basic types. The first 
type is a vertical upper half-plane passing through the origin with angle 0 measured 
counterclockwise from the real axis, which we denote by H?(6). The second type 
is an upper hemisphere centered at the origin with radius r, which we will denote 
by S*(0). The t.g. surfaces of H? are the H?(@), the S*(0), and their translates 
by elements of C. For each of the basic t.g. surfaces, we produce an isometry g € 
Aut(H?), necessarily orientation-reversing, such that Fix(g) is precisely the surface 
in question. The existence of such a g shows that the surface is a t.g. surface. 

We define 

H? = HUCUc 
to be the usual closure of H? and extend the action of fractional linear transforma- 
tions and the notion of the convex hull in the usual way. For any subset .Y of H?, 
YF will denote the closure in H3. For g € Aut(HP), we will likewise use g to denote 
the extension of g to the closure H®. Henceforth, we will work exclusively in the 
setting of the closure H? of H®. Thus, we will actually identify the closures of the 
t.g. surfaces. 

The basic orientation-reversing isometry of H may be denoted by R*. With 
xy + xvi+ yj Ee H?, we have 


R* (x, + xoi + yj) = x1 — Xoi + yj. 


Clearly, we have Fix(R*) = H2(0). To obtain isometries corresponding to the other 
vertical planes, let 
i0/2 
Ro = ( 2 ) 
0 et {2 


Because RgH?(0) = H2(0), we have 


Fix(e(Ra)R*) = H2(6). 
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To define the isometry J such that Fix(/) is the basic hemisphere Se (1), let 
Z denote the conjugate of the quaternion z, 1.e., if z = x; + x21 + yj, then Z = 
xX, — X2i — yj. For z € H’, set 


T(z) = 1/z. 


We have the equality z/Z(z) = ||z||?. Observe that S{(0) is precisely the set of 


quaternions in H® of norm one. Thus, Fix(/) = Sf (0). For the more general 


hemispheres S* (0), set 
Jr 0 


Vr 


Then, since A(r)S;* (0) = S+ (0), we have Fix(e(A(r))/) = S*(0). 
In order to denote the convex hull in H?°, we use the notation @y. Therefore, if 
ds” is the hyperbolic metric on H?, we have 


Ca(Pi,.--, Pr) = Cas2(Pi,- +++ Pr)s 


in terms of our original notational conventions. 
Let pi,..., py € H’, forr > 3, not lying on the same totally geodesic surface, 
such that foreachi, 1 <i <r, 


Di € Cu(Pi,---. Pi-1, Pitt,-+-+ Pr): 


Then the set G@y(~1,..., P,) will be called the solid convex polytope with vertices 
at pi,..., py. It is clear that for any p1,...p, € H not lying in the same totally 
geodesic surface, @y(p1,..., Pr) is a solid convex polytope with vertices consisting 
of some subset of the r points. 


Description of 7 (7) as a solid convex polytope. 


Proposition 2.19. The solid convex polytope with four vertices in HS given by 
F(G) = Gq + V2j,2 + 5,1 +i + j, 00) (74) 
is a good Grenier fundamental domain for the action of ! = e~'(SO3(Z[i])) on HS. 


In the interactive graphical representation, [3], (GY) is the solid region inside 
the triangular prism and above the red sphere. 
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2.4 SO(2, 1)z as a group of fractional linear transformations 


We will now use the results of §2.1 and §2.2 to deduce a realization of Tz = 
SO(2, 1)z as a group of fractional linear transformations, as well as a description 
of a fundamental domain for Iz acting on H* that is in some sense (to be explained 
precisely below) compatible with the fundamental domain of I" acting on H?. 

We maintain the notational conventions established in §2.1. In particular, G = 
SO3(C) and I” = SO3(Z[i]). It is crucial, for the moment, that we observe the 
distinction between G, I and their isomorphic images under c~!. 


Definition 2.20. Set 
Tz, = e(SL2(R) Ne (L)). (75) 


Remark 2.21. Note that the elements of Iz do not have real entries! The naive 
approach to the definition of [72 would be to take the elements of J” with real 
entries, as in the case of SL2(Z[i]) and SL2(Z). However, this clearly cannot be the 
right definition because the resulting discrete group would be contained in SO(3), 
hence compact, and hence finite. The justification for Definition 2.20 is contained 
in Proposition 2.22, below. 


Recall the orthonormal basis 6 for Lie(SL2(C)) defined in (4). Define a new 
basis 7 by specifying the change-of-basis matrix 


a?" — diag(1, —i, 1). (76) 


Let Vz be a real vector space of dimension 3. Let SO(2, 1) denote the group of 
unimodular linear automorphisms of Vg preserving a form Bg on Vg of bilinear 
signature (2, 1). For definiteness, we will take 


Vg = R-span(7) C Lie(SL2(C)), Br = Blrg, 


where f’ is the basis of Lie(SL2(C)) defined at (3), and B is as usual the Killing 
form on Lie(SL2(C)). From the fact that 6 is an orthonormal set under B and from 
(76), it is immediately verified that B|g has signature (2, 1). Note also that 


V := Ve ®C = Lie(SL2(C)). 
By considering SO, as a subset of GL3(R) we obtain the standard represen- 
tation of SO(2, 1). We define SO(2, 1)z to be the matrices with integer coefficients 


in the standard representation of SO(2, 1). 
Recall from (8) the definition of the morphism 


Cy = Cy, : SLo(C) > SL3(R). 
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Proposition 2.22. Let Iz be as defined in (2.20). Then the restriction of ¢, to Vz 
provides an isomorphism 


c, : SL2(R)/{+1} > SO(2, 1)° (77) 


of Lie groups. The isomorphism of (77) further restricts to an isomorphism of 
discrete subgroups 


ce, :¢ (Iz) > SO(2, 1)z. (78) 
As a result, we exhibits an isomorphism 


Tz = SOQ, 1)z. (79) 


The next proposition, Proposition 2.24, is the analogue of Proposition 2.8 for 
the real form of the complex group. Proposition 2.24 below is, in contrast, almost 
a triviality to prove at this point, since it can be deduced rather readily from 
Proposition 2.8. 

For Proposition 2.24, it is necessary to recall the &-subsets of SL2(ZIi]) defined 
in (22) and (23). For each of the three &'-subsets, we define 


(5)z = ENSL2(R). (80) 


The following result both justifies this notation and clarifies the meaning of 
Proposition 2.24, below. 


Lemma 2.23. Each (&)z-group can be given the following description: 


(81) 


In order to obtain ©) in this manner, we may take, in (81), 
(pq) =(10) and (F5)=(01). 
Further, we may take 
(pq) = (1 1). inorder to obtain 2, and Ey, 


and 


al 
ee 


Ts)= (0 1) , in order to obtain &\, 


— 


rs 


— 
— 


= (1 0) , inorder to obtain &). 


104 E. Brenner and A. Sinton 


Proposition 2.24. With Iz defined as in Definition 2.20, we have 


ce !(Iz) = (212)z 'e (Ga (( re . (82) 


From (82), we deduce the following analogue of Lemma 2.9 


Lemma 2.25. Let c~! (Iz) be the discrete subgroup of SL2(R) defined in (75), and 
given explicitly in matrix form in (82). All the other notation is also as in Proposition 
2.24. 


(a) We have 
ce 1(1z) 1 SL2(Z) = (S2)z. 


(b) We have 
[ce (Iz) : (812)z] = 2, [SLo(Z) : By] = 3. (83) 


Explicitly, a representative of the unique nonidentity right coset of (&\2)z in 
cl) is 
1 f/10\/f1-1\_ 1 /f1-i1 
alii)o2)= ala): 


2.5 Fundamental domain for SO(2, 1)z acting on H? 
and its relation to that of SO3(Z{i]) 


The main point of this section is that, provided the fundamental domain % of the 
standard unipotent subgroup of e~!(I7z) is chosen in a way that is compatible with 
the choice of ¥ in (70), then the good Grenier fundamental domain .7p (Ye) for 
c'(Iz) corresponding to Ye will have a close geometric relationship to .F(¥). 
Based on the classical example of Dirichlet’s fundamental domain for SL2(Z) acting 
on H? and the Picard domain, one might guess that we would have the equality 


Fe(Ge) = F(G)OW. (84) 


In fact, this intersection property cannot hold, because of the presence of additional 
torsion elements (the powers of wg/z) in c!(I’). However, in a sense which will 
be made precise in Proposition 2.27, below, the next best thing holds. Namely, 
the intersection of the set consisting of two I’-translates of F(Y) with H? equals 
Fp (Ge), for the choice of Gp in (85), below. 

In the case of Tz, C Autt(H’), commensurable with SL2(Zfi]), we have the 
obvious analogue of Theorem 2.14, defining a good Grenier fundamental domain 
for the action of Iz. In order to distinguish the real case Tz, C Aut*(H*) from 
the complex case, we add the subscript R to the sets Y ¥,, F(Y), and so write 
Gy Fp, Fp(YR). In this case, the good Grenier fundamental domain coincides 
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with the classical notion of the Ford fundamental domain for a discrete subgroup of 
Autt (H?) of finite covolume. See, for example, [16], p. 44. However, we use the 
terminology Grenier domain even in this context, in order to stress the eventual 
connections with the higher-rank case. 


Explicit Descriptions of Gg and 7g(Yp). 
Lemma 2.26. (a) We have 


(b) The interval 
Gy := [0,2] (85) 


is a fundamental domain for the action of Ug on R satisfying 
Gp = IntGe. 
(c) With Gg as defined in (85), part (b) implies that 


Fe (Ge) ={z € WH | 0 < x(z) <2, y()? + (x — 1)? = 2} = Gy, 2 +i, 00). 
(86) 


Geometric relation of 7g(Yg) to ¥(Y). In order to relate the fundamental 
domain of a subgroup of SL2(IR) acting on H? to the fundamental domain of a 
subgroup of SL2(C) acting on H?, we consider H? embedded in H? as the totally 
geodesic surface H*(0). Note that 


H?(0) = {xi+ yj | y > 0}, 


and the actions of SL2(R) on H? and H?(0) are equivariant with the obvious 
isomorphism 


Ww —; H?(0), mapping x + yj xi + yj. 
Under this isomorphism of SL2(R)-homogeneous spaces, -Fx (Ye) corresponds to 
€4(j, 2i + j, 00) in H’(0). (87) 
Because of the isomorphism, we can safely ignore the distinction between the forms 


of Fp(Ge) in (86) and (87). 
Because, as can be verified readily, 


Fe = (g Ue(T1) (R) g) n HR, (88) 
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we cannot hope that we will have the straightforward relation 
Fe (Ge) = F (G) WF 


that we find in the classical case of SL2(Z|i]) and SL2(Z). However, we do have the 
next best possible relation between the fundamental domains. 


Proposition 2.27. We have the relation 
F (Ge) = (FG) Ve(T) (RE) FG) OHG. 


Remark 2.28. We note for possible future reference that Ap(Yg) is the normal 
geodesic projection of the union of A(¥Y) and one translate ¢(7;) (R2) F (GF) 
2 


of ¥(Y). This relation between the fundamental domains is connected to the one 
given in Proposition 2.27, though neither relation implies the other, in general. The 
relation may be of some use in relating spectral expansions in the complex case to 
spectral expansions in the real case. We explored one aspect of this topic in §1 of 
this paper, above. 


In the interactive graphical representation [3], the front face of the “cut-off prism” 
solid amounts to one half of Ag(Ye). 


2.6 Spectral zeta functions 


This section discusses a potential application of the results of this paper and 
indicates a future line of investigation building on this work. Jorgenson and Lang, in 
works such as [17], [19] (see the introduction to the latter work especially), and [20], 
have laid out and begun to pursue an ambitious program of using heat kernel analysis 
to associate additive spectral zeta functions to quotients of symmetric spaces. When 
completed, this theory would subsume the basic theory of the Riemann zeta function 
and Selberg zeta function (among others), and clarify the relationship between the 
zeta functions arising at different geometric levels. The main component of the 
program is obtaining a theta inversion formula. 

In [19], which carries out the derivation of the theta inversion formula for the 
special case of 


X =I\G/K = SLo(Zfi])\SL2(C)/SU(2, C), 
the authors compute the regularized trace of an integral operator on functions on X. 


The kernel of the integral operator is K, y (z, w), the heat kernel on X. The trace of 
such an integral operator is defined to be the integral on the diagonal 
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/ K,.x (z, z) dz. 
X 


Although this integral is infinite, because of the cusp of X, the integrals over sets 
Xy approximating by covering X only up to some fixed finite “distance” in the cusp 
are finite and diverge logarithmically in Y . That is, 


lim / K,.x (z, z) dz — c1(t) log Y exists as a C-valued function of tf, (89) 
Xy 


Yoo 


where c(t) is a factor, constant in Y, and determined in [19]. For the purposes 
of such an integration, we can replace X with a suitable fundamental domain .F. 
Similarly, we replace the truncation Xy of X with a matching truncation Yy of ¥. 
To obtain the theta inversion formula, the limit of (89) is computed in two ways. 
One computation is from the expression of the heat kernel as the periodized heat 
kernel on the universal covering space H*. This method of computing the limit in 

(89) yields 
e741) 2 ON (1/1) + OW(1/1). (90) 


In (90), ONC (1 /t), the inverted theta series, is defined in terms of invariants of 
certain I"-conjugacy classes in I", while O“S(1/t), the inverted theta integral, is 
a sum of products composed of special values of fg), constants similar to Euler’s 
y, and single integrals whose Gauss transforms are exact. (We refer to §XIV.7, of 
[19], for exact definitions of ON°(1/1r), @™S(1/t) and the other terms in the theta 
relation.) The other method of computing the limit in (89) is from the expansion of 
K,. x (z, z) dz in terms of the spectrum of the Laplacian Ay. This second method of 
computing the limit of (89) yields 


cus (t) =F 1 + Ogis(t), (91) 


where Ocys(t) is the theta series i ej! anda j are the eigenvalues of Ay, and 
Ogis(t) is what remains as the limit of the integral of the convolution of K,_y (z, w) 
with certain Eisenstein series once the term c;(t) log(Y ) has been subtracted. Setting 
equal the two expressions (90) and (91), for the same limit (89), we obtain the theta 


inversion formula for X: 
e(4t) 2 ON(1/t) + ON (1/1) = Ocuslt) + 1+ eis). 92) 
Next, note that there is an infinite sequence of arithmetic quotients 
X, = SL, (Zfi])\SL,(C)/SU(M), n> 1, 


having X = Xp) as its first nontrivial member. Generalizations of (92) to X;, 
for n > 2 are discussed in [20]. In order to obtain exact formulas analogous to 
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(92), we would have to integrate over a fundamental domain, rather than over an 
appoximating Siegel set, which is a more common analytic model in the literature. 

In the present work, we initiate an extension of the Jorgenson—Lang project to 
the sequence of arithmetic quotients 


SO, (Zfi])\SO;(C)/SO(n) (93) 


and related arithmetic quotients of real forms of the symmetric spaces. The main 
results of the present paper are restricted to the group theory (Propositions 2.8 
and 2.24) and fundamental domains (Propositions 2.19 and 2.27) in the first case 
of n = 2. Nevertheless, some of the intermediate results are couched in a more 
general terminology and notation, with a view towards building upwards from the 
case n = 2, to the case of a general n. Thus, our project includes a natural extension 
and generalization of Grenier’s work in [13] and [14] to a different sequence of 
symmetric spaces. 
The identification 


SL2(C)/{+1} —> $O3(C) 


allows us to view the theta inversion relation (conjecturally) associated with the 
case n = 2 in (93) as a theta inversion relation associated with a quotient of 
SL2(C)/K by an arithmetic subgroup different from, but still commensurable with, 
the “standard” arithmetic subgroup SL2(Z|i]). The results of this paper will, it is 
hoped, enable future investigations to apply the machinery developed in [19] to 
this “nonstandard” arithmetic subgroup c~!(SO3(Z[i])) of SL2(C) to obtain the 
corresponding theta function. 
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Dedicated to the memory of Serge Lang 


Abstract The é-characters of an abelian variety [B95] are arithmetic analogues 
of the Manin maps [M63]. Given a smooth projective curve X of genus at least 
2 embedded into its Jacobian A, one can consider the restrictions to X of the 6- 
characters of A; the maps so obtained are referred to as 5-characters of X. It is 
easy to see that the -characters of X have a remarkable symmetry property at the 
origin. The aim of this paper is to prove that this symmetry property completely 
characterizes the 5-characters of X. 


Key words curves ¢ jacobians ¢ local fields * fermat quotients 
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1 Introduction 


In [M58, M63] Manin introduced some remarkable additive characters of the 
group of points of an Abelian variety defined over a complex function field; these 
characters are referred to as Manin maps and were used by Manin in his proof of the 
complex function field analogue of the Mordell conjecture [M63]. Manin’s result 
had been conjectured by S. Lang in [Lan56]. In [B95] arithmetic analogues of the 
Manin maps were introduced; they were called 6-characters and in their definition 
Manin’s horizontal differentiation was replaced by a Fermat quotient operator, 6, 
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. . —nP . . 
which on integers acts asn b én := a where p is a fixed prime. More 


A 


generally, if R := Ze. is the completion of the maximum unramified extension 
of the ring Z, of p-adic integers, then one considers the operator 5 : R > R 
defined by 


_ px) — x? 
at 


where ¢ : R > R is the unique lift of the p-power Frobenius map on k := R/pR. 
(Morally one views 6 as an arithmetic analogue of a derivation.) Then, if X is a 
smooth scheme over R and f : X(R) — R is a function, one says that f is a 
6-function of order r if for any point in X(R) there is a Zariski open neighborhood 
U Cc X,aclosed embedding u : U Cc AN, and a restricted power series F with 
R-coefficients in (r + 1)N variables such that 


6x: 


S(P) = FUu(P), 6(u(P)),...,6"(u(P))), Pe U(R). 


Here we view u(P) € A(R) = RN and we say F is restricted if its coefficients 
tend to 0. If G is a group scheme over R, then by a 6-character one understands a 
6-function wy : G(R) > R which is also a group homomorphism into the additive 
group G,(R) = R. For the construction of 6-characters on Abelian schemes we 
refer to [B95]; there always exist 5-characters of order 2, there sometimes exist 5- 
characters of order 1 and, of course, there are no non-zero 6-characters of order 0. 
Note that 6-characters were used in [B95] to prove an arithmetic analogue of the 
Manin theorem of the kernel [M63] and they also have a number of applications to 
dynamical systems and modular forms; cf. [BO5] and the bibliography therein. 
Now if X is a smooth projective curve over R of genus g > 2, then one can 
compose the 6-characters of the Jacobian of X with Abel-Jacobi maps of X into 
its Jacobian. What one gets are 5-functions X¥(R) — R which we shall call 6- 
characters of X. If f is a é-character (corresponding to an embedding of X into its 
Jacobian that sends the point P° into the origin), then the “formal germ” of the map 


X(R) x X(R) > R 


(Pi, P2) +> f(Pi) + f(P2), (1) 


at (P°, P°) satisfies a certain symmetry property which we shall refer to as 6- 
symmetry. The aim of this paper is to prove that, somewhat surprisingly, the converse 
of the above is also true: if a 6-function f : X(R) > R vanishing at a point P° is 
such that the “formal germ” of the map (1) at (P°, P°) is 5-symmetric, then f is 
a 6-character. What we have here is that formal behavior of a function at one point 
guarantees the correct global behavior of the function. 

We refer to the body of the paper for the precise definition of 6-symmetry and 
further discussion of this concept. We content ourselves here with a few remarks: 


Remark 1. (1) For 6-functions f of order 0 (ie., for functions f arising from 
usual algebraic geometry) on an affine curve X the é-symmetry of the formal 
germ of the map in Equation (1) will be automatic; it will be a consequence of 
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the usual fundamental theorem of symmetric functions. However the analogue 
of the fundamental theorem of symmetric functions fails for 6-functions of order 
> 1 and consequently 6-symmetry of formal germs of 5-functions will be far 
from automatic. 

As we shall see, there exist 5-functions X(R) — R which are not p-adic limits 
of polynomials in 6-characters. 

In [B08] it is shown that 6-symmetry, in the special case of modular curves, is 
closely related to the action of the Hecke operator T (p) on 6-modular forms (in 
the sense of [B05]). 


es 


wa 


(3 


wm 


In Section 2 below we will introduce the main concepts and notation and we will 
state our main theorem. Section 3 is devoted to the proof of the theorem. 


Acknowledgement While writing this paper the author was partially supported by NSF grant 
DMS 0552314. 


2 Main concepts and statement of the theorem 


Let R Ze, k = R/pR, ¢,6 : R — R, be as in the introduction. We denote 
by K = R[1/p] the fraction field of R. We will assume throughout the paper 
that p # 2. For X a smooth scheme over R we denote by O"(X) the ring of 6- 
functions X(R) — R of order r. For G a smooth group scheme over R we denote 
by X’(G) the R-module of 6-characters G(R) — R of order r. There are natural 
inclusions O'"'(X) Cc O'(X), X"-!(G) Cc X"(G). Also there are natural maps 
6: O'1(X) = O'(X), f & 80 f,and¢ : X""(G) > X"(G), VR dow. 
Recall from [B95] (or [B05], Definition 3.5) that there exists a projective system of 
formal schemes 


II 


Lp PO) SP Se OO) =, 


called the p-jet spaces of X, such that the ring O" (X) naturally identifies with the 
ring O(J"(X)) of global functions on J’ (X). Here * stands for p-adic completion; 
on the other hand, completion with respect to a maximal ideal will be denoted, as 
a rule, by the superscript for. By a 6-morphism f : X — Y of order r between 
two smooth schemes we understand a morphism of formal schemes J’(X) — Y. 
The latter induces unique morphisms of formal schemes J’t'(X) — J°(Y) that 
commute (in the obvious sense) with 6; this allows one to compose 5-morphisms 
of orders r; and r2 to get a d-morphism of order r; + rz. Note that the set of 6- 
morphisms X — A! of order r identifies with the set of 5-functions X(R) > R of 
order r. We shall repeatedly use the fact ([BO5], Corollary 3.16) that if U C X is 
open, Xo := X @k is irreducible, and Up 4 M, then O" (X) > O"(U) is injective 
with torsion-free cokernel. 
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Definition 1. Let X be a smooth projective curve over R of genus g > | and let 
P® € X(R) be an R-point. A é-morphism f : X — A! is called a 5-character 
centered at P° (of order r) if 


f=yVoB, 


where B : X > A := Jac(X) is the Abel-Jacobi map P +» O(P — P°®) and 
w:A— G, =A! is a 6-character of A (of order r). 


Remark 2. It follows from [B95], pp. 325-326, that the R-module of 5-characters 
of order r on X centered at P° has rank between (r — 1)g and rg. Note that any 
6-character on X centered at P° vanishes at P° and actually vanishes on the set 
of torsion points X(R)jors = B~'(A(R)tors). More generally any 5-character on X 
takes the same value at P,Q € X(R) whenever P — Q is torsion in the divisor 
class group. In particular the p-adic closure C’(X) of the R-subalgebra of O" (X) 
generated by all 6-characters of X (centered at various points) does not separate 
points whose difference is torsion in the divisor class group. On the other hand if 
the genus is g > 2 andr > 1, then by [B96], p. 365, O"(X) separates the points 
of X(R). So, for all r > 1, we have C’(X) 4 O'(X) provided there exist distinct 
points P, OQ € X(R) such that P — Q is torsion in the divisor class group. 


Let T = {T',..., 7%} bea g-tuple of variables. (Later in the paper g will be the 
genus of our curve.) By a (smooth) local formal scheme (of relative dimension g) 
we understand a formal scheme of the form X'" ~ Spf RI[T]]. If T’,...,T” are 
additional g-tuples of variables, then the local formal scheme 


J’ (X™) = Spf RIIT,...,T?] 
will be referred to as the p-jet space of X' of order r. One defines the maps 
6: R{[T,..., T°] > RI[T,..., TOF] 
by the formula 


FY (T? + pT’,...,(T)? + pT) — F(T,...,T®)? 
8(F(T,...,T®)) := Vee? (rm)? +P ees Lae 
P 


the upper (¢) means “twisting coefficients by ¢”. A 6-morphism X'® — Y'* of 
order r is a morphism of formal schemes J"(X'") — Y'*, Any such morphism 
induces unique morphisms J™*!(X'") — J'(Y'") for all i > 0 such that the 
corresponding ring homomorphisms commute, in the obvious sense, with 6. This 
allows one to define, in an obvious way, the composition of two 6-morphisms of 
local formal schemes. 

Let F € RI[T,, To]]* be a formal group law in g variables (always assumed 
commutative); the local formal scheme G'™ := Spf R{[T]] equipped with the 
formal group law G' x G' —> G' defined by F will be referred to as a local 
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formal group. In particular we denote by G*" = Spf R[[z]] the local formal group 
defined by the formal group law z; + z2. By [B05], Proposition 4.39, for any local 
formal group G'™ and any r > 0 we may consider the formal group law in g(r + 1) 
variables 


(F,6F,...,8°F) € RIT, T’,..., Tr ]ECr, (2) 


(That this tuple is a formal group law follows from an obvious universality property 
argument.) The formal group law (2) defines a local formal group J’(G®") which 
can be referred to as p-jet space of G'™ of order r. By a 5-character G'* + Gi 
of a local formal group G* we understand a 5-morphism G'*" — G* which is 
compatible in the obvious sense with the group laws. 


Remark 3. Assume g = 1. If ¥ has coefficients in an unramified extension O of 
Z, with residue field O = O/pO of size p” and if F € O[[T,, To] has either 
infinite height or height , then there always exists at least one non-zero 6-character 
Gi + G' of order < vA; this is a consequence of the proof of Proposition 4.26 
in [B05]. 


Definition 2. A 6-morphism 
} : xfer = Ai for = Spf Ri[z] 


from a local formal scheme will be called a 5-character if there exists a morphism 
Bi ; xfer —, Gl into a local formal group G'™ and a é-character w* : Gi > 
G* = Aler such that 


for __ for for 
f= ep. 


Let t),...,%m be a tuple of variables and s1,...,5,, another tuple of variables, 
m > 2. Let S; := Sj;(t1,....tn) be the fundamental symmetric polynomials in 
Ugs-iceg bys 
S)=t) te:-+tye: ++. Sm = t..-tn. 


Remark 4. The R-algebra homomorphism 
Rioescs Sse Oe ee OO S Rishi OM °C) 
sy) > Sj, 


is injective. Indeed it is enough to check the corresponding statement with R 
replaced by K, the quotient field of R. But 


Kitiscattiniotst ee COS Ke ste eo Ghat Gl oC 


and similarly for the s;’s. Also we have an equality of ideals 


CneccsPivash, aid) = (t1,--.stns.+-.P' (t),-.-5%' (tn)) 
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so Equation (4) holds with polynomials replaced by power series. So we are led to 
check that 


K[[s1, siesd Ge Spisce se .d' (s1), was 2’ (Sm)]] — K{[h, seve bypgin se P(t), eee »P’ (tn)]] 


is injective. We claim the latter map is faithfully flat and this will prove its injectivity. 
Now the claim follows from the fact that the inclusion 


K[s1,...,5m5--+,0' (51), ---5 0" (Sm)] > K[t,.-. tne. O'(h),--- O' (tn)] 


is finite and flat and the ideal (f),...,tn,...,@'(t),..-,@'(tm)) is the only 
maximal ideal above (s|,...,5n,---,9'(51),---, 0" (Sm)). 
In what follows we shall view the map (3) as an inclusion. 


Definition 3. A series in R[[t1,....tm.-...t(?,...,04?]] is 6-symmetric if it is in 
the image of R[[s1,....5m.....9?,. 6. 88 J]. 


Let ¢ be a variable. For any series F € R[[t,t’,...,t]] and any m > 2 one can 
consider the series 


m 


a > we (Gitiaoatt”) ER [eee eer ale 


i=l 


These series are not 5-symmetric in general. For example, the series Y(t’)? is not 6- 
symmetric. On the other hand, for instance, Y,,t’ is 5-symmetric for any m > 2. An 
obvious class of examples of series F' such that %),, F are d-symmetric is provided 
by the series in 


>> ¢' CRILeM). 
i=0 


A less obvious class of examples of series F such that &),, F are 6-symmetric is 
provided by Lemma | below. Let us also note that if 2’, F is 6-symmetric then ¥; F 
is 5-symmetric for all i < m. Also it is worth noting that if F € R[t,t’,...,¢7, 
then one always has 


DinF € Riis ons tagicu, ike go Ay 


“mM 


where 
A:= [[@ = tj)” E Rs, ea Sin 
i<j 
is the discriminant polynomial; this is an immediate consequence of [B05], Proposi- 


tion 3.27. So, morally, what prevents X,, F from being 5-symmetric is the possible 
“occurrence of A in the denominators”. 
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Definition 4. Let ff : X' — Al?" = Spf R|[z]] be a 5-morphism with X'* 
of relative dimension | and let 


for forym ey 1, for\m mle 1,for 
Din f (Xy" — (A yy >A 


be the induced 5-morphism, m > 2. Fix an isomorphism X' ~ Spf R{[t]] and 
consider the series 
FPF re RE ccc], 


We say that ¥,, f!° is 6-symmetric if the series 


Sof 0] Rilfiscitanend eek | 
is 6-symmetric. (The definition does not depend on the choice of the isomorphism 
X'" ~ Spf Rift.) 


Now if X is a smooth curve over R and P® € X(R) is a point with reduction 
mod p denoted by Py € X(k), then the completion X' of X at P)) is isomorphic 
to Spf R|{[t]] so it is a local formal scheme. Fix such an isomorphism and let 


i272 3h 


be a 6-morphism with f(P°) = 0. Then f naturally defines a 5-morphism of local 
formal schemes 
Fda a x tor > ALfor. 


cf. [B05], p. 126. 
The main result of this paper is the following: 


Theorem 1. Let X be a smooth projective curve of genus at least 2 over R and let 
f : X — A! be a 5-morphism vanishing at some point P° € X(R). The following 
are equivalent: 


(1) f is a6é-character centered at P°; 

(2) f'® is a 6-character; 

(3) Ym f' is 5-symmetric for all m > 2; 
(4) Sf is 6-symmetric. 


The implications 1 = 2 and 3 => 4 are clear. The rest of this paper is devoted 
to the proof of 2 > 3 and 4 > 1. Before proceeding, let us remark that conditions 
2, 3 and 4 in the Theorem look a priori much weaker than condition 1; indeed, 
the R-module of 5-characters of order r on X" has infinite rank whereas the R- 
module of 5-characters on X centered at P° has finite rank. What morally makes 
the implication 4 = 1 hold is the additional assumption that f is a global object, 
defined on the whole of X. 
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3 Proof of the theorem 


The next lemma settles the implication 2 = 3 in Theorem 5: 


Lemma 1. Let f!" : X' — A!S°" be a $-character where X* has relative 
dimension 1. Then S\y, f © is 5-symmetric for all m > 2. 


Proof. Assume we are in the situation of Definition 2 with 


X™ = Spf Rill]. 
G'" = Spf R{IT]]. 


AN = Spf Rilzll. 
Here T is a g-tuple of variables. Let the morphism B' : X" + G' be given by 
pT = Ot) € Rife] 
and let w®* : Git + G'* be given by 
pe: = w= wweT,...,T”) € RIT,..., TJ]. 
Let F,, € RI[T1,..., Tn ]]® be the tuple of series defining the m-fold addition 
(Geryn 5 Gir, 
Then (by a universality property argument) the tuple of series 
(Fin, 5Fm,..-,8" Fim) 
defines the m-fold addition 
(7(Gyy" > I™(G™). 


Recall the classical fundamental theorem of symmetric polynomials saying that 
the symmetric polynomials in R[t1,..., tn] belong to the ring R[s1,..., Sm]. By this 
theorem applied to the homogeneous components of 


G = Fm(P(t1), ne ay P(tm)), 


we have G € R[[s1,...,5m]]&. Then, using the fact that YW defines homomorphism, 
we get 


> W (P(t) gees 5’ P(t; )) = WY (Fim (P(t) geese P(tn )) geees 8” Fin (P(t) a) P(tm))) 


i=1 
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Lemma 2. Let Y be a smooth scheme over R, with irreducible fibers, and let 
U CY bean open set. Let Yo := Y @k, Up := U ®k, and assume Yo\Up 
has codimension > 2 in Yo. Then O' (Y) = O'(U) for all r > 0. 


Proof. Forr = 0,O0°(Y) = O(Y), where * means p-adic completion, and the result 
is well known. (For convenience we recall the argument. We may assume Y is affine. 
Take f € O(U). Then the image f € O(Up) extends uniquely to some g; € O(Yo). 

Since Y is affine g; is the image of some g) € O(Y). So f=eaitph.fi€ OU). 

Similarly one finds go € O(Y) and he O(U) such that ‘as = g2+ pfo. Continuing 
we get, in a similar way, f = gi + pg. +...+ a &n + Pp" fn» Bi OY), 

hé O(U). Clearly the restriction of }> p’~ la € OY) to U is J.) Now assume r 
arbitrary and recall that O"(Y) = O(J"(Y)), where J" (Y) is the p-adic completion 
of some smooth scheme. So it is enough to check that J’(U) ® k has codimension 
> 2in J’(Y)@k. This follows immediately from the fact that J” (Y) ®k is a locally 
trivial bundle over Y @k with fibers affine spaces and J” (U) @k is the inverse image 
of U @k in J"(Y) @k; cf. [B05], Proposition 3.13 and Corollary 3.16. 


Lemma 3. Let X be a smooth projective curve over R, let X" be the n-fold product 
of X (n > 2), let S, x X" — X" be the natural action of the symmetric group, 
and let X" = X"/S, be the n-fold symmetric product of X. Let 1 = T : 
X" — X) be the natural projection. Let f € O"(X) an let P® € X(R) be 
such that f(P°) = 0. Let f" € O" (X") be defined by 


SF" (Pi,.-+, Pn) = fCP1) +++ + f (Pn), Pi € X(R). 


Assume 5» f® is 5-symmetric. Then f" descends to a function f™ € O'(X™) 


(ie, f? = f™ ax), 


Proof. View the k-points of X @) as effective divisors on Xo := X @k. Let H C 
X) be the image of the union H = UX; ; Of all “principal diagonals” Xj; C X"; 
so, fori < j, Xj; is the image of 


xr! — xX", 
(Piss f Pa Pieces Fiat Pp, Pippin Pps Pi Pyfigees Pp): 


Then Hp := H @ k is the image of re hence is an irreducible divisor on x 
The projection 
X"\H > X\H 

is a finite Galois étale morphism so by [B05], Proposition 3.27, f” descends to a 
function f” € O' (XH). Choose a reduced divisor Do on Xo of degree n — 2 
whose support does not contain P,’ and consider the k-point 2P) + Do € Ho(k). 
Claim 1. There exists a neighborhood U of2P; + Do € X™(k) in X™ such that 
J” descends to a function rs e€ OU). 

Note that Claim 1 can be used to end the proof of our lemma. Indeed, 
fi ™ © O'(X\H) and db “ € O'(U) obviously coincide on the R-points of 
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Un (XH) so they yield a function ie € O'(U U (X™\H)). Since Ho 
is an irreducible divisor on X(””, the complement of Up U (X(”)\ Ho) in X”” has 
codimension > 2. By Lemma 2, il extends to a function f) € O"(X), Since 
fon = f” onan open set with non-empty fibers over R it follows that the latter 
equality holds on the whole of X”. 

We will now prove Claim 1. 

Choose an affine neighborhood U of IF + Do in X™ such that the divisor 
H NU is given in U by one equation h € O(U). By shrinking U we may assume 
that U has étale coordinates 


815825 [35 000g tas 


where s; = t) + fb, S2 = tt such that f), tf come from an étale coordinate ¢ on 
X around Pe and f3,...,¢, come from étale coordinates on X around the points in 
the support of Do. Let A = O(U) and denote by o the variables s), 52 and by t the 
variables t3,...,t,. (By the way, we may assume the image of h in R[[o, t]] equals 
5S} — 452; we will not need this explicit expression for the image of h.) By [B05], 
Proposition 3.13, we have identifications 


O'(U) = Afo’,1’,...,0, YP, (5) 


O"(U\H) = (AnY[o'=,0’,...,6, cP. (6) 


We may view qe as an element in the ring (6). On the other hand, by our 


assumption that ¥> f ®" is 6-symmetric, we may view f,” 


as an element in the ring 
Rio-no'.2...5,07, 2], (7) 

The rings (5), (6), (7) are subrings of the ring 
(Rilo. Ha) Ilo’. t’,..-,. 06, e 7]. (8) 


So Claim I will be proved if we prove 


Claim 2. The intersection of the rings (6) and (7) inside the ring (8) equals the 
ring (5). 


In order to prove Claim 2 we prove a series of other claims. Let 
h:=h@1é€A:=AQk. 
Claim 3. k{[o,t]] N A; = A. 


To check the non-obvious inclusion “C” consider the maximal ideal M := 
(o,t) C A. Then the M-adic completion of Ay is k[[o,t]] so the extension 
Ay C k[[o, t]] is faithfully flat. So if Q(A) is the fraction field of A, then 


k[[o, tI] Q(A) = Ag. 
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In particular 

k[[o, t]] 9 A; C Ay M A; = A, 
which ends the proof of Claim 3. (The last equality holds because any element 
in Ay 9 A; defines a rational function u on Spec A = Uy = U ® k which is 
regular at the point M and also outside the hypersurface Hy M Uo; since Hy N Uo 
is irreducible, u is regular outside a closed subset of codimension > 2, hence is 
regular on the whole of Uo.) 
Claim 4. R{[o, t]] N (Any = A. 

The non-obvious inclusion is, again, “Cc”. Now if u € R[[o, t]] N (A;,)’, then its 
reduction mod p satisfies u € k[[o, t]] N A;. By Claim 3 there exists aj € A such 
that u— a, € p(A,)/N pRifo, t]] sou — a, = wy with uw, € R{fo, t]] M (An). 
Repeating the procedure we find, for any , that 


n—1 


u=a,+ pazt+:::+p Gn + p"Un, 


where a; € A, u, € R{[o, t]] N (An) which clearly implies u € A. 
Let us prove Claim 2. Any element F in the intersection of the rings (6) and (7) 
is an element in the ring 


(Rilo, t]] 9 (An) Ilo’, 7’... 06, eT. (9) 
By Claim 4 the ring (9) equals the ring 
Allo’. t’,...,0, J]. (10) 


So F is in the intersection of the ring (10) with (6), hence is a series in 
o’,t’,...,06%, c with coefficients in A which tend to 0 in the ring (Ay). Since 
A> a4 is injective, p(An) NM A= pA, so p" (Any A A= pA for any n. So the 
coefficients of F tend to 0 in A and hence F is in the ring (5) which ends our proof. 


Lemma 4. Let Y be a smooth scheme over R, E a locally free sheaf on Y, and 


Py (E) the associated projective bundle over Y . Then, for all r = 0, the natural map 
O"(Y) > O'(Py (E)) is an isomorphism. 


Proof. We may assume € is trivial so Py(€) = Y x P’. If Y = Spec R then our 
assertion was proved in [B96]. For arbitrary Y pick a point 9° € P’(R) and let 
f € O'(Y x PY). Denote by s : Y > Y x P% the map s(P) = (P, O°). If 
ma: Y xP -— Y is the first projection, then 


(foson)(P,Q) = f(P,Q°) = f(P, Q) 


because the map 
P'(R) > R, Or f(P.Q) 


is constant (by the case Y = Spec R of the lemma). We just proved surjectivity of 
the map O'(Y) + O' (Py (E)). Injectivity is obvious. 


We are ready to prove the implication 4 = 1 in Theorem 5. 
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Proof. Assume condition 4 holds. For n > 1 let B, : X™ -» A = Jac(X) be 
the Abel-Jacobi map which, at the level of geometric points, sends a divisor D of 
degree n into (the isomorphism class of) O(D — nP°). By Lemma 3 the induced 
6-morphism f” : X" — A! descends to a 6-morphism f“) : X¥ — Al!, Recall 
that for any n > 2g — 1 (where g is the genus of X) we have X"” = P4(E,) 
for some locally free sheaf €, on A. By Lemma 4, f descends to a 6-morphism 
Wn: A— Al. If B = B, : X > A, then for any P € X(R), 


(Wn 0 B)(P) = Yn(O(P — P°)) 
= Wn(O((n — 1) P° + P —nP®)) 
= f(n-1)P° + P) 
= (n- 1 f(P) + f(P) 


Sir): 
Sow,oB = f. 


We claim that y, is a d-character and this will end our proof. Fix the integer 
n > 2g — 1 in what follows. Recall that B, : X (s) _5 A induces an isomorphism 
B,'U ) — U where U C A is some open set with non-empty fibers over R. So 
there is an open set V C U with non-empty fibers over R such that if rz, : X° > 
X‘8) is the canonical projection then Vix m,'B,'(V) — V is étale. So any R- 
point € of V lifts to an R-point of X%, i.e., there exist P},..., P, € X(R) such 
that Be(P; +--+ + Py) = &. Hence, if De := Pi +--+ + Py + (n — g)P°, then 
Bn(Deg) = &. Now let &|, & € V(R) be such that & + & € V(R). Then 


(De, —nP°) + (Dz, —nP°) ~ Deze, —nP®, 


where ~ means linear equivalence on the geometric generic fiber of X.So De, + De, 
and Dz,+¢, +nP° correspond to R-points of X°”) that map to the same R-point of 
A via B2,. Hence we have: 


Vin (E1) + Wn(E) = f™ (De) + f (De) 
= fO") (De, + De) 
= f°) (Dere +nP®) 


= f(Di+_) + nf(P) 


= fDyto) 


= Wn(—i + &). 
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The above identity holds for all R-points of the open set (V x V) M w~!(V) where 
fu: Ax A => Ais the addition hence it holds on A x A hence w,, is a 6-character 
and our claim is proved. 
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Weyl group multiple Dirichlet series of type A, 


Gautam Chinta and Paul E. Gunnells 


In memory of Serge Lang 


Abstract A Weyl group multiple Dirichlet series is a Dirichlet series in several 
complex variables attached to a root system ®. The number of variables equals the 
rank r of the root system, and the series satisfies a group of functional equations 
isomorphic to the Weyl group W of ®. In this paper we construct a Weyl group 
multiple Dirichlet series over the rational function field using n’” order Gauss sums 
attached to the root system of type Az. The basic technique is that of [10, 11]; 
namely, we construct a rational function in r variables invariant under a certain 
action of W, and use this to build a “local factor” of the global series. 


Key words Multiple Dirichlet series * Eisenstein series 
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1 Introduction 


Weyl group multiple Dirichlet series are Dirichlet series in r complex variables 
51, 52,...,8, that have analytic continuation to C’, satisfy a group of functional 
equations isomorphic to the Weyl group of a finite root system of rank r, and 
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whose coefficients are products of n'" order Gauss sums. The study of these series 


was introduced in [2], which also suggested a method for proving their analytic 
continuation and functional equations. 

Recently a complete proof of these expected properties has been given in [12]. 
In this paper we describe in detail the construction for the root system Az. There 
exist alternate constructions of the series defined here. For A, andn > 2 one falls in 
the stable range, and therefore our result follows from the work of [3]. (In fact, this 
case was treated earlier in [2].) Nevertheless there are several reasons why a new 
treatment of Az is desirable. First, the methods used here are completely different 
from those of [3] and give an alternative technique to construct Weyl group multiple 
Dirichlet series. Second, the technique presented here works for a root system ® of 
arbitrary rank and for arbitrary 7, with no stability restriction. This is the subject of 
[12]; one of the main goals of the present paper is an exposition of our method in 
the simplest nontrivial case, namely ® = Ap. 

With this latter goal in mind we also adopt certain assumptions to make the 
exposition simpler. For instance, we work over a rational function field to avoid the 
annoyance of having to deal with Hilbert symbols. We also focus on the untwisted 
case (see §2 for an explanation of this terminology) to avoid some notational 
complexities. A comparison with the methods of [2, 10, 11] indicates how to extend 
our methods to an arbitrary global field containing the 2n' roots of unity and to 
arbitrary twists. 

We now describe our main result in greater detail. Let F be a finite field whose 
cardinality g is congruent to 1 mod 4n. Let K be the rational function field F(t), 
and let O = Ft]. Let Omon C O be the subset of monic polynomials. We let 
Koo = F[t~'] denote the field of Laurent series in ¢~!. 


For x, y € O relatively prime, we denote by (:) the n’" order power residue 


symbol. We have the reciprocity law 


()=( a 


for x, y monic. The reciprocity law takes this particularly simple form because of 
our assumption that the cardinality of F is congruent to 1 mod 4. 

Let y + e(y) be an additive character on Kg with the following property: if 
I C K is the set of all y € K such that the restriction of e to yO is trivial, then 
I = O. Fix an embedding € from the the n'” roots of unity in F to C*. For r,c € O 
we define the Gauss sum g(r, €,c) by 


reeo= D e((2))e(2). 


y mod c 


We will also use the notation g;(r,c) = g(r,e',c) and g(r,c) = g(r,€,c). Note 
that €’ is not necessarily an embedding. 
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We are now ready to define our double Dirichlet series. Put 


Z(s1, 82) = (1 _ gr _ amas wae Ol ra ais aie 


(ci, C2) 
1.2 
. » > Jeq | |e)” ( ) 


€1€O mon C2€Omon 


where the coefficient H(c;, c2) is defined as follows: 


(1) (Twisted multiplicativity) If gcd(c1c2, djd2) = 1, then 
H(cidi,¢2dy) d en 
ie naa) aa) aa) a) (2) (3) 


(2) (p-part) If p is prime, then 


> A(p’. p')x*y! = 14 g(.p)x + gL ply + g(L pgp, p)xy 


k1>=0 
+ g(l.p)g(p.p-)xy* + g(1. p)g(p. p)x7y 
+ g(1.p)e(p. p-)x7y”. (1.4) 


Our main result is 


Theorem 1.1. The double Dirichlet series Z(s,, 52) converges absolutely for Re(s;) 
sufficiently large and has an analytic continuation to all (s,, 52) € C?. Moreover, 
Z(51, 82) satisfies two functional equations of the form 


01: (81,52) H (2 — 51, 8; + 52 — 1) and 02: (51, 52) + (8) + 52 —1,2— 52). (1.5) 


These two functional equations generate a subgroup of the affine transformations of 
C? isomorphic to the symmetric group S83. 


The precise statement of the functional equations involves a set of double 
Dirichlet series Z(s), 52:7, 7), where 0 < i, 7 < n—1, and where Z(s1,52) = 
yi. j Z(S1, 5237, 7); we refer to Theorem 4.1 for details. Moreover, one can explic- 
itly write down Z(s,, 52) as a rational function in g~*'!,q~*?. For n = 2, this was 
first done by Hoffstein and Rosen [16] (in the case of the rational function field with 
q congruent to | mod 4), and later by Fisher and Friedberg [13] (over a general base 
field). For n > 2, again working over the rational function field, the Az series have 
been computed by Chinta [8]. 

As stated above this theorem follows from the work of [2,3]. In [6], the authors 
study the harder problem of constructing twisted Weyl group multiple Dirichlet 
series associated to the root system A,. They construct such series for Az and 
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present a conjectural description of the series associated to A, for arbitrary r and n. 
Recently, Brubaker, Bump and Friedberg have given two different proofs of their 
conjectures [4,5], thereby giving a complete definition of Weyl group multiple 
Dirichlet series associated to A,. In fact, in [4] the authors prove that the series they 
construct are Fourier-Whittaker coefficients of Eisenstein series on a metaplectic 
n-fold cover of GL,4+1, thereby establishing Conjecture 1.4 of [3], in the case 
G = GL,+,. Consequently, these Weyl group multiple Dirichlet series inherit from 
the Eisenstein series functional equations and analytic continuation. Contrastingly, 
the techniques used in [5] make no use of the connection to Eisenstein series other 
than in the rank | case. 

This is also the case in our paper. Our method has the advantage that functional 
equations are essentially built-in to our definition. As in the case of [2,3, 6, 10, 11] 
the Wey] group multiple Dirichlet series are completely determined by their p-parts 
and the twisted multiplicativity satisfied by the coefficients. Our approach is to show 
that if the p-parts (which can be expressed as rational functions in the |p|~*') satisfy 
certain functional equations, then the global multiple Dirichlet series satisfies the 
requisite global functional equations. This leads us to define a certain action of W, 
the Weyl group of the root system ®, on a certain subring of the field of rational 
functions in r indeterminates. This approach, first introduced in [7], has been carried 
out in the quadratic case for an arbitrary simply-laced root system, see [10, 11]. 
We extend this approach to arbitrary ® and n in [12]. However, although the basic 
ideas are clear, the non-obvious group action required on rational functions can 
appear unmotivated and complicated in the general setting. Therefore, we feel it is 
worthwhile in this paper to work out in detail the simplest nontrivial case, the rank 
two root system Ap. 

Here is a short plan of the paper. Section 2 describes the Weyl group action on 
rational functions that leads to a p-part (1.4) with the desired functional equations. 
Although the focus of this paper is untwisted Az, we work more generally at 
first and state the full action for a general (simply laced) root system. We then 
specialize to untwisted Az. Section 3 reviews the Dirichlet series of Kubota; in the 
current framework, these series are Weyl group multiple Dirichlet series attached 
to A,. The main result of this section is Theorem 3.4, which shows that a certain 
Dirichlet series E(s,m) built from the function H(c,d) from (1.3)-(1.4) satisfies 
the same functional equations as Kubota’s. Finally, in Section 4 we use Theorem 
3.4 to complete the proof of Theorem 1.1. The basic idea is that the (one variable) 
functional equations of the E(s,m) induce a bivariate functional equation in the 
double Dirichlet series. 
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2 A Weyl group action 


Let ® be an irreducible simply laced root system of rank r with Weyl group W. 
Choose an ordering of the roots and let ® = ©+ U ®~ be the decomposition into 
positive and negative roots. Let 


A = {01,Q2,...,a,} 


be the set of simple roots and let o; be the Weyl group element corresponding to 
the reflection through the hyperplane perpendicular to a;. We say that 7 and 7 are 
adjacent if i # j and (o;0;)° = 1. The Weyl group W is generated by the simple 
reflections 0), 02,...,0,, which satisfy the relations 


3 ifi and j are adjacent, 
(o;0;)'")) = 1 with ri, 7) = 4 1 ifi = j, and (2.1) 
2 otherwise, 


for 1 <i, 7 <r. The action of the generators 0; on the roots is 


a; +a; ifi and 7 are adjacent, 


Oj; = ) —; ifi = j, and (2.2) 
aj otherwise. 
Define 
sgn(w) = pete 
where the length function on W is with respect to the generators 0), 02,...,0,. Let 


A be the lattice generated by the roots. Any aw € A has a unique representation as 
an integral linear combination of the simple roots: 


a= kyo, + koa. +--+ kay. (2.3) 


We denote by 
d(a) =k +ky+--+k, 


the usual height function on A and put 


dj(a) = ye, 


inj 


where i ~ j means that the nodes labeled by 7 and j are adjacent in the Dynkin 
diagram of ®. Introduce a partial ordering on A by defining a > Oif each k; > 0 in 
(2.3). Given a, B € A, definea > B ifa— 6 > 0. 

Let A = C[A] be the ring of Laurent polynomials on the lattice A. Hence A 
consists of all expressions of the form f = >> BeA agx?, where ag € C and almost 
all are zero, and the multiplication of monomials is defined by addition in A: x?x* = 
xP+4_ We identify A with C[x1,x7!,...,x-,x,'] via x” 1 x;. 


r 
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Let p be a prime in © of norm p. Let A be the localization of A at the 
multiplicative subset of all expressions of the form 


{1 _ git) ge) 1- peng) | ae or, 


The group W will act on A, and the action will involve the Gauss sums gi(1,p).! 
There is one further parameter necessary for the definition. Let £ = (/),...,/,) be 
an r-tuple of nonnegative integers. The tuple @ is called a twisting parameter; it 
should be thought of as corresponding to the weight }°(/; + 1)w,;, where the w; 
are the fundamental weights of ®. The case £ = (0,...,0) is called the untwisted 
case. For each choice of £ we will define an action of the Weyl group W on A. 

We are now ready to define the W -action. First, we define a “change of variables” 


action on A as follows. for x = (X|,X2,...,X,) define o;x = x’, where 
PXiX; ifi and 7 are adjacent, 
x, = 4 1/(p?x;) ifi = j,and (2.4) 
xj otherwise. 


One can easily check that if fg (x) = x is a monomial, then 
poms Per’, (2.5) 
Next, write f € Aas 
f(x) = > apx?. 
B 


Given integers k,i, 7, define 


Asi, j) = > apx?. 
By. =i mod n 
d;,.(B)=j mod n 


We define the action of a generator 0, € W on f as follows: 


n—-1 n-1 
(f leo) (®) = (xn) S05 (Pi (Xt) fen i, 7 — Ie) 
i=0 j=0 
+ Qi (xk) ficlonx; j +1-i, j —1)), (2.6) 


'We remark that our normalization for Gauss sums follows [3,6] and not [10,11]. See [11, Remark 
3.12] for a discussion of this. 
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where 
ee 1-1 
ij(x) = (pry +e ATL 
1 = p'—!xn 
= * 1 l—n LS p'x" 
Qij(x) = —87;-;-1 1, p) (Px) T— pr lxn’ 
‘ si(l.p)/p ifn ti, 
g; (,p) = . 
—1 otherwise. 
Here (7), € {0,..., n — 1} is the remainder upon division of i by n. We extend 


this action to all of A first extending (2.6) to all of A by linearity, and then given 
f/g € A by defining 


(4| a, ) () = (Ff |eoK) (x) 
gle* ~ g(OKX) 


One can show that this action of the generators extends to an action of W on A: in 
particular the defining relations (2.1) are satisfied. 

Now we specialize to the focus of this paper: we set ® = A» and £ = (0,0). To 
simplify notation we write x, y for the variables of ‘A. With these simplifications the 
action of 0; on f € A takes the form 


n—-1n-1 


(flov.») = Oo (Pi OA (Fp ery: /) 


i=0 j=0 
+ Qy@)A(ge pr +1-‘))i 2D 
the action of o> is similar. An invariant rational function for this action is 


N(x, y) 


h ; = ’ 
(x y) (i = p'!x")(1 _ pr yry — p'n-lynyr) 


(2.8) 


where the numerator N(x, y) is 
N(x, y) = N(x, y) = 1+ gi(l,p)x + gi p)y + ai(, pgi(p, p)xy 


+ pgi(l, p)go(1.p)xy? + pei, p)go(1, p)x7y 
+ pgi(l,p)ga(1, p)x7y’. (2.9) 
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To compare this with (1.4), note that pg2(1, p) = gi(p, p’). Also note that only the 
numerator of (2.8) appears in (1.4) because the denominator is incorporated in the 
factors appearing at the front of (1.2). 

Let us write h(x, y) as 


(2.10) h(x,y) = D> a(p*, p')x*y’ 


k1>=0 


n—1 
-O9'(E LY aot ohs'] 
1>0 1=0 k=i modn 

n—-1 
= 2 » y'h® (x; i). 


1>0 i=0 


The following two lemmas are proved by a direct computation. 


Lemma 2.1. We have N)(x,0) = 1+ g) (1, p)x, N®(0, y) = 14+ 91(1, p)y and 
for j =I modn, and0 <i <n—1, 


ne xsi) = (px)! Py h (si) + (p)' Oy (A (Geel + 1= i), 
Lemma 2.2. Let 
fPIC: i)= hOO (x: j= 6g9;—-1-1(1, p) pei! x AFI Dn JOD (x, | de =F) 


where 6 = 0 if! —2i = —1 mod n and is | otherwise. Then 


fC) = Ga Teh Fs). 


px 


3 Kubota’s Dirichlet series 


The basic building blocks of the multiple Dirichlet series are the Kubota Dirichlet 
series constructed from Gauss sums [17, 18]. Let m be a nonzero polynomial in O 
and let s be a complex variable. These series are defined by 


(3.1) D(s,m) = (1— gry ye ne 
d€Omon | | 
and 
(3.2) D(s,m;i) =(1-—q™™™5)"! > — 
deg d=i mod n | | 


d€Omon 
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Kubota proved that these series have meromorphic continuation to s € C with 
possible poles only at s = 1 + 1/n and satisfy a functional equation. Actually, 
Kubota worked over a number field, but the constructions over a function field are 
identical. 

If the degree of m isnk + j, where 0 < j <n-—1, this functional equation takes 
the form 


(3.3) D(s,m) =|m|'*  S° j(s)D(2—s,m;i), 


0<i<n-1 


where the 7;;(s) are certain quotients of Dirichlet polynomials. For fixed s the 
T;; depend only on 2i — j. We will not need to know anything more about the 
functional equation, but a more explicit description can be found in Hoffstein [15] 
or Patterson [20]. 

Given a set of primes S, we define 


(3.4) Ds(s,m) = (1- gts) ye g(m,d) 


(d.S)=1 |d| 
d€Omon 


If mp = ees p we sometimes write Dj, (s,m) for Ds(s,m). 
We record some properties of Gauss sums that we will use repeatedly. 
Proposition 3.1. Leta,m,c,c' € O. 


(i) If (a,c) = 1, then g;(am,c) = (2) g0n.0). 
(ii) If (c,c’) = 1, then 


Cc 21 
gi(m,cc’) = gi(n,c)gi(m.c’)(<) 


Using this proposition we can relate the functions Ds to the functions Ds, for 
different sets S and S’. This is the content of the following two lemmas. 


Lemma 3.2. Let p € Omon be prime of norm p. For an integer i withO <i <n-1 
and m,,™z,p all pairwise relatively prime, we have 


iy, Blanzp', p'*?) wie 
Dm, (s, mop ) = Dym, (Ss, mop ) Ss ~ ptttns Dem s, mp" )n . 


More generally, 


( p n—i-2)n 
bom = ¥ (11 tr Jos(s. TL» TT). 


SocS \peSo pes peSo 
p! ||m p' ||m 
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Proof. We prove only the first part of the lemma. For p, 1, m2 as in the statement, 


m2p',d 
a- go "Dm (S, mop ) = = » g( etme’ sd) ) 
(d,m,)=1 |d| 
d€Omon 
-) SE g(mrp', dp*) 
Als pks 
k>=0 (dm, p)=1 |d|s Pp 
dEOmon 
es » x pve DEE) d ) 
7 S yks k 
k>0 (dymip)=1 |d| Pp p? 
E mon 
= > : is : 
(d,mp)=1 ~ dF k>0 a ) 
d€Omon 


The Gauss sum in the inner sum vanishes unless k = 0 ori + 1. This proves the 
lemma. al 


Inverting the previous lemma, we obtain 


Lemma 3.3. [f0 < gi <n —2andm,,m2,p as above, 


Dy, (s, mp") _ g(mop' , we) Dye (s, mop"—*-?) 


Dym, (s, mop') = 


[= |pjz—l—"s |p| @+Ds t= |p|z—l-ns 
and ifi =n—1, 
i Dn (s, mp") 
Dym,(S,m2p') = T= [pps 


Now suppose that N(x, y) = N)(x, y) is the polynomial from (2.9). We define 
a function H on pairs of powers of p by setting H(p*, p’) to be the coefficient of 
xky! in N(x, y): 


N(x, y) = >) A(p*, p')x*y! 


We extend H to all pairs of monic polynomials by the twisted multiplicativity 
relation: if gcd(cd, c’d') = 1, then we put 


G5) Hec.dd’) = He.ane.ay(5) (ZY (5) (2). 


In particular, note that 


(3.6) H(d,1) = g(l,d). 
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Now consider the Dirichlet series 


(3.7) E(s,m) = (1 = | es i > H(d,m) 


ds 
ZEO won 


That E(s,m) satisfies the same functional equation as D(s, m) is the main result of 
this section. 


Theorem 3.4. Let m € Omon be a monic polynomial of degree nk + j, where 
O0< j <n-1. Then 


E(s,m) = |m|!$ Tj (s) E(2 — s,m; i). 


0<i<n-1 


Proof. Before tackling the general case, we first consider m = p! for a prime p and 
1 > 0. Then 


E(s,p') = (= gy > so H(dp* Pp ‘) 


ds ks 
d€Omon k=O Ip! 
(d,p)=1 


H(p*,p')e(1,d); d 
=(I-g™)' YD ae (Sz): by G.5) and 3.6) 


d€Omon k>0 
(d,p)=1 
= 3 HOP )p ( pln) 
rere 
n—-1 Z 
a 1 Api) 
= > Pols, p" a) js PD nks 
P= rer 


n—-1 


= D7 Dols, pO yh Upl J) 


j=0 


where h'?-”) was introduced in (2.10). Using Lemma 3.3 the previous expression 
becomes 


n—-1| 


de Ds, pO ya Ups A) 


j=0 


a) I-2f)n pU—2/)n +l 
gpl As pO ae) ips one 
ga ~ 28; 1p | C=2i Fs DG per ne ble); 
= 
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where 6; = Oif / —27 =n — 1(n) and is | otherwise. Replace j by / + 1—j in 
the second summation and regroup to conclude 


n—-1 


(3.9) E(s.p') = S> D(s, pl") FO (pl-s: f) 


j=0 


(Note the use of the identity n —2—(/—27), = (27 —1 —2),,.) Using the functional 
equations (3.3) of D and f-) (Lemma 2.2), we write 


E(s,p')|p|[-O-™ 
n—1 n-1 


= D5 > Tiga), egy (9) D2 — 5, p 2; 1) FOO (2 — 5; 7) 


j=0 i=0 


n—1 
= > Ti-; degp,c—2/), degn (8) DQ — 5, p97"; 1 — j degp) f° (2-5; j) 


i,j=0 


n—-1 n—-1| 


= D5 Tisaegp(s) | >> DQ—s, p07"; i — j degp) f°? 2—s; j) 


i=0 j=0 
n—-1 

(3.10) = >> Tirdegp(s)E(2—s,p'3 i), 
i=0 


where the third equality comes from our remark that the T;; depend only on 2i — j. 
This is the functional equation we wished to prove, in the special case m = p!. 

The argument for general m is similar. Let m = pipe veep where the p; are 
distinct primes and the /; are positive. Then 


H(d,m) 


E(s:m) = ( gees be ‘di 


dé Omon 


k - 1 : 
=(1- qr)! be > H(dp;! +++ pk, pil s+ pir) 
dEOmon ke iin k,>0 |d|5|py [Fs «+= |p, Ars 
(d,m)= 


ae H(d, 1)H(p"', p)--- Hp", p!) 
ap ae ee 


d&O mon Ry sere > 
(d,m)= 


= ka la kan = 
) Go) TCG 


1B gp Pp P, 
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n—-1l n—-1 


=T1(4 ‘) SS y Din (s, pt —2f)n , - plein) 


a#b Pp A=0  jr=0 


Ja Ja 


(3.11) xy! Ns: yes p@rlr Ns; iv) Pa Pa \— 
TG) Gp) 


Denote for the moment by C(j1) = C(j1,..., j-) the product of residue symbols 


Ja Ja 


i pa pa \— 
(3.12) cw = T(E) an ; 
a#b fees 
Letting J; = (/; — 2/;), fori = 1,...,7, we have 
(3.13) 
(1 = [pil *™)Din(s, py! ++ PE )CCD) = Dopp, (8, pt! + pZ)CGD 
(pi . ++ pir Fale Qji- 1 -2)n 


on [pi] tbs Doyy-p, (8, Pj py pe CU) 


by Lemma 3.3. In the second term on the right-hand side, replace j; by /; + 1— ji. 
For 5;, 4 0 this gives 


(3.14) 


Qji-h —2)n 


271—-h-1)n 
g(ps Qji-h- dn) 


J 
po +: p . 
cree ma Dorp, (8, PUPS PEC = fr + VD. 


The Gauss sum can be written as 


(3.15) (aa a genial, 


21-1 
Py 


and C(/, — j; + 1) is 


Pi is Hi gets 2 aes ja. — 
a Gy Cy (Ty) 
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Taking the product of (3.15) and (3.16) yields 


(3.17) 
a) 
heen : ’ 


j;—l ~e)n ji—l ~1)n ‘ 
Seg a CG. 
Therefore, continuing from the last line of (3.11), 


(3.18) 


n—-1 


L n-1 
Pa N—2iv)n 2}, 
E(s,m) = TT (T) So Yo Danls, py pylon) 


a#b 6b ji=0 jr=0 


pa peal (Pili) (e+ 7.) P2l2)(e* 7 Orddees 3 
T(E) (Be) re aM) HOM) 
a#b b b 


where m! = pe tee pr. Repeating this procedure to remove the primes from m one 
at a time, we find that up to a constant of modulus one, E(s,m) is equal to 


a 


Ja 


n—-1 n—-1 r pi p 
(3.19) So SO DG, py!) (1 rj) MS) 


lp 
ji=0 jr=0 a=1 a#b *b P, 


We may now apply the functional equations of D and the f««) as in (3.10) to 
conclude that E'(s,m) satisfies the functional equation 


n—1 


(3.20) E(s,m) = |m|!* Y"Ti.degm(s)E(2 — s,m; i). 
i=0 
This completes the proof of the theorem. oO 


For later use, we record the following bound: 


Proposition 3.5 For alle >0,m € Oand0 <i <n, 


1 for Re(s) > ; +e 
(s—1—Z)(s—-14 Z)EG, mi) Ke 4 |mj2+© for S—e<Re(s)<ite 
|m|'S*© for Re(s) < $e 
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Proof. Use the meromorphy and functional equation of E(s,m) together with the 
convexity principle, cf. [14, Eq. (2.3)] and [19, Propostion 8.4]. oO 


4 The double dirichlet series 


Recall the definition of the double Dirichlet series from (1.2)—(1.4). In this section 
we show that Z(s;, 52) has a meromorphic continuation to s;,52 € C and satisfies 
a group of functional equation isomorphic to W. In [2], the authors show in detail 
how the analytic continuation of a Weyl group multiple Dirichlet series follows from 
the functional equations. Therefore we concentrate on establishing the functional 
equations of Z(s), 52). 

Actually we need to consider slightly different series. For integers 0 < i,j < 
n — | we define 


(4.1) Z(s1, 5231, J) = ad —q" mst) la —q" ms2) q ge ey 


se. 2 ae 
m€Omon d€Omon jm i Id P 


degm=i mod n degd=j mod n 


We further introduce the notation 
Z(s1,8231,*) =) Z(s1, 92:4, f) 
J 
and 


Z(s1,523*, J) = Y > Z(s1, 523i, j).- 


These series are absolutely convergent for Re(s;), Re(s2) > 3/2. In fact, we can do 
a little better. Summing over d first yields 


(4.2) 
Z(s1, 823i, *) = ( —q" 1a ag" m2) id =f" 


1 H(d,m) 
«line 2nd 


m€Owmon d€Omon 
deg m=i mod n 
= (1 _ go) — gy y E(s2, m) 


m|*! 
m€Omon | | 
deg m=i mod n 
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By the convexity bound of Proposition 3.5, this representation of Z(s,, 5237, *) is 
seen to be meromorphic for Re(s;) > 0, Re(s2) > 2. Alternatively, summing over m 
first we deduce that Z(s,, 52:7, *) is meromorphic for Re(s2) > 0, Re(s,;) > 2. Let 
R be the tube domain that is the union of these three regions of initial meromorphy: 


R = {Re(s,), Re(sz) > 3/2} U {Re(s,) > 0, Re(s2) > 2} 
U {Re(s2) > 0, Re(s}) > 2}. 


Let the Weyl group W act on C? by 
(4.3) oy : (81, 52) (2-81, 51 +52—1), 02: (51, 82) (81 +52 —-1,2—57). 
Let F be the real points of a closed fundamental domain for the action of W on C?: 


F = {Re(s;), Re(sz) = 1}. 


One can easily see that R ~ F M FR is compact. Therefore, by the principle of 
analytic continuation and Bochner’s tube theorem [1], to prove that Z(s,, 52) has a 
meromorphic continuation to C? it suffices to show that the functions Z(s1, S237, j}) 
satisfy functional equations as (5), 52) goes to (2 — 51,5; + 82 — 1) and (s; + 52 — 
1,2 — s2). For details, we refer to [2, Section 3]. 

To prove the o2 functional equation, we begin with (4.2) and write 


E(so, 
Z(s1, 82; i, *) = (1 ee gy = gion a (s2 m) 


m|*! 
m€Omon | | 
deg m=i mod n 


= (1 _ gg =g ome 
n—-1l 


1—s2 
x > ma Tji(s2)E(2— s,m: j), by Thm. 3.4 
j=0 


m€Omon 
deg m=i mod n 


n—-1 
= So Tjils2)Z(s1 + 82 —1,2— 523i, j) 
j=0 


The o; functional equation is proved similarly. 
We conclude that 


Theorem 4.1. The double Dirichlet series has a meromorphic continuation to 
51,52 € Cand is holomorphic away from the hyperplanes 


1 1 1 
s,s =lt—-,s =lt—ands,;+5.=224-. 
n n n 
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Furthermore, Z(s1, 82) satisfies the functional equations 


R 


1 


2, 


Z(s1,52) = » Tji(82)Z(s1 + $2 — 1,2 — So3i, J) 


id 


= S27 (s1)Z2 = 51,51 + 82-151, 7). 


ij 
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present contribution relates to his wonderful treatise on differential geometry [18]. 
Advocating a coordinate-free formalism, classical results from (finite-dimensional) 
Riemannian geometry were masterfully extended to infinite-dimensional manifolds 
modeled on a Hilbert space. With few exceptions (e.g., the Hopf—Rinow theorem 
is not any more valid), most classical results are shown to hold in the infinite- 
dimensional setting. It is only natural to wonder whether one can, in the same 
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The motivation for this question is the fact that certain equations of mathematical 
physics (like the inviscid Burgers equation from gas dynamics, the Euler equation 
of hydrodynamics, the Camassa—Holm equation from shallow water theory) were 
shown to be re-expressions of geodesic flow for right-invariant metrics on groups 
of smooth diffeomorphisms, and for such complex systems the need for more 
elegant formulations becomes an issue of paramount importance. The infinite- 
dimensional Lie groups of smooth diffeomorphisms are not Hilbert manifolds but 
Fréchet manifolds (their topology is not induced by an inner product but by a 
countable family of seminorms), situation which raises a number of highly non- 
trivial technical issues since basic analytic results (like the existence and uniqueness 
of local solutions for ordinary differential equations with a smooth right-hand side, 
or like the inverse function theorem) are known not to be valid in general within this 
setting [12]. Nevertheless, rigorous analytical results can be obtained. 

We illustrate the approach by considering the example of the diffeomorphism 
group of the circle. The presented results are to a large extent the fruit of our 
joint research effort over the last five years, through which we benefited from the 
generous encouragement and support of Professor Serge Lang. 


1 Introduction 


In 1966 Arnold [2] showed that the Euler equations of hydrodynamics can be 
obtained as the geodesic equations for the infinite-dimensional Lie group of smooth 
volume- and orientation-preserving diffeomorphisms of the fluid domain D with 
respect to the right-invariant L? inner-product, given on the corresponding Lie 
algebra of divergence-free vector fields tangent to the boundary of D by 


wv) = f wendy. 


Subsequently, other equations from mathematical physics were found to have an 
interpretation as geodesic flows on diffeomorphism groups (see for example [13, 14, 
20,21]). Our aim is to explain the setting presented by the diffeomorphism group of 
the circle. Why study Euler equations on the diffeomorphism group of the circle? 
For two reasons: On the one hand, it is the simplest of the diffeomorphisms groups 
and it is expected that understanding some mechanisms within this setting can give 
insight to deal with more ambitious situations. On the other hand, because it is 
already the configuration space of two famous equations arising in fluid mechanics: 
the inviscid Burgers equation and the Camassa—Holm equation. 

Before proceeding with this concrete example in Section 3, we first discuss some 
fundamental aspects in the general case of an abstract Lie group in Section 2. 
Section 4 is devoted to the study of bi-Hamiltonian structures for Euler’s equations 
on the diffeomorphism group of the circle. A final section deals with the case of the 
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Virasoro group and the KdV equation: a case which has been considered prior to the 
case of the diffeomorphism group of the circle but which, from a didactic point of 
view, has to be studied afterwards because of additional technical difficulties. 


2 Invariant metrics on an abstract Lie group 


2.1 Left-invariant metrics 


A left-invariant metric on a Lie group G is determined by its value at the unit 
element e of the group, that is, by an inner product on the Lie algebra g, expressed 
in terms of a symmetric! linear operator 


A:g—> og", 


called the inertia operator. If t +> g(t) is a geodesic” with g(0) = e, denote by 
g(t) the derivative with respect to ¢, and let 


m(t) =< g(t),->,€ TG. 


2.2. Angular velocities and momenta 


Introducing the left and right angular velocities by 
un = L,-18, UR = R,-18, 
and the left and right angular momenta by 
m, = Lom, mr= Rim, 
the following relations hold between these four geometrical objects*: 
my, = Aur, Ur = Adgur, mr = AdF m_, (2.1) 


where L, R stand for left, respectively right translation. 


A is symmetric if (Au,v) = (Av,u) for all u,v € g*, where the round brackets stand for the 
pairing of elements of the dual spaces g and g*. 

Notice that geodesics issuing from some go € G are obtained via left translation by go from 
geodesics issuing from e. 


The coadjoint action of G on g* is defined by (Adz m,u) = (m, Adg- u). 
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2.3. Noether’s theorem 


In its initial formulation by Emmy Noether in 1918, the theorem states that to each 
infinitesimal transformation which leaves the Lagrangian of a variational problem 
invariant there corresponds a first integral of the motion. In the sixties, Souriau [24] 
extended this result to the more general setting of Symplectic Mechanics. Before 
giving a (weak*) statement of this theorem let us recall the following fact. If a group 
G acts smoothly on a manifold M, to each vector v € g corresponds a vector field 
on M defined by 


d 
Mylz)= 7 _| __ explsy)(a): 


s= 


Theorem 2.1 (Noether theorem). Let (M,w) a symplectic manifold such that 
w = dO (0 € Q'(M)). Let G be a Lie group acting smoothly on M and such 
that g*0 = 6 for all g € G. Then, for each G-invariant function H € C™®(M), 
and each v € g, the function 0(X,) is a first integral of the Hamiltonian vector 
field X x. 


2.4 Euler’s equations 


A Riemannian metric on a Lie group G induces an isomorphism between TG and 
T*G which permits to pullback on TG the canonical 1-form of T*G. We will 
call it 0. The exterior derivative @ = d@ is a symplectic form on TG, and the 
Hamiltonian flow of the function 


1 
A(X) = 5 <X_,X,>, Xe €T,G 
corresponds to the geodesic flow of the metric. If the Riemannian metric is invariant 


by the natural left action of G on itself, so are @ and H. If x. is the vector field on 
TG ~ G x g induced by v € g, we get 


®e.y(Xv) = (Ad# (Au), v), 
where A is the inertia operator defined previously. In particular 


e(t).uz(t)) (Xv) = (me(t), v), 


“The strong version does not require w to be an exact form. It only assumes that w is G-invariant 
and that the symplectic group action of G on M has a moment map. 
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and an application of Noether’s theorem leads us to Euler’s first theorem 


dmpr 
dt 


= 0. (2.2) 


This may be considered as a generalization of the angular momentum conservation 
law of a free rigid body. 
Taking the time derivative of the third relation of (2.1), we then obtain Euler’s 
second theorem 
dm L 
dt 


where ad*m represents the coadjoint action? of g over g*. 


= —adi, mL, (2.3) 


2.5 The contravariant formulation 


It is useful to have a contravariant formulation of (2.3). With this purpose in mind, 
let us introduce the bilinear operator B : g x g — g defined by 


<ad,v,w >=< B(w,u),v >, u,Vv,W Eg. 
Equation (2.3) can then be rewritten as a quadratic differential equation on g, 
uy = B(uy, uz). (2.4) 


This may be considered as a generalization of the evolution equation for the angular 
velocity of a free rigid body with a fixed point. 
Therefore, integration of the geodesic equations may be reduced to two succes- 
sive quadratures 
g = Ls UL , 


2: 
uy = Biuy, uz). Co 


Remark 2.2. The Levi-Civita connection of a left invariant metric is also left- 
invariant. It is thus completely defined by the knowledge of 


1 1 
(View Lev) (e) = zu 2a (Btu, v) + Biv, u)). 


When the metric is bi-invariant, B is skew-symmetric and u;, = 0. In that case, the 
geodesics through the unit element are just the one-parameter subgroups of G. 


It is defined by (ad* m,u) = —(m, ad, uv). 
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2.6 Right-invariant metrics 


In the case of a right-invariant metric, relations (2.1) become 
Mr = AUR, ur = Adg ur, mr = Ady my, (2.6) 
and Euler’s equations become 


dm, dmr * 
. = 0, G = ad) MR. (2.7) 


The contravariant formulation of second Euler’s equation becomes 


ur = —B(ur,ur), (2.8) 


3 Right-invariant metrics on the diffeomorphism group 
of the circle 


In his landmark treatise [18], Serge Lang succeeded in fursning an elegant in-depth 
study of differential geometry on manifolds modeled on an abstract Banach space, 
proving fundamental results corresponding to the classical theorems known in the 
finite-dimensional case. 

In this section, we will deal with the group Diff(S') of smooth diffeomorphisms 
of the circle that are orientation-preserving. This group is naturally equipped with a 
Fréchet manifold structure. More precisely, we can cover Diff(S') with charts taking 
values in the Fréchet vector space® C®(S') and in such a way that the change of 
charts are smooth maps (see [5] for more details). 

Since the composition and the inverse are smooth maps for this structure we say 
that Diff(S!) is a Fréchet-Lie group. Its “Lie algebra” Vect(S') is isomorphic to 
C~(S!) with the Lie bracket given by 


[u,v] = Uxv — uy. 


°A topological vector space E has a canonical uniform structure. When this structure is complete 
and when the topology of E may be given by a countable family of semi-norms, we say that F isa 
Fréchet vector space. In a Fréchet space, such classical results like the Cauchy—Lipschitz theorem 
or the local inverse theorem are no longer valid in general as they are in on Banach manifold. the 
typical example of a Fréchet space is the space of smooth functions on a compact manifold where 
semi-norms are just the C*-norms (k = 0,1,...). 
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3.1 Right-invariant metrics on Diff(S') 


A right-invariant metric on Diff(S!) is defined by an inner product a on the Lie 
algebra of the group Vect(S') = C%(S!). If this inner product is local,’ then 
according to a result of Peetre [22], it is given by 


a(u,v) = / u A(v) dx u,v € C%(S!), 
Sl 


where A is a symmetric linear differential operator. This operator A, that will 
be assumed invertible, is called the inertia operator. The corresponding bilinear 
operator B defined in Section 2 is then given by 


B(w, u) = A! [2A(w)u, + uA(W)x] u,w € C%(S}). 


An important special case is when the inner product is given by the H*-Sobolev 
norm on C%(S!) (k > 0), 


ay (u,v) = i‘ (uv + UyVy feet uy) dx u,v € C%(S!). 


In that case, the inertia operator is 
2 2k 


d k 


3.2 Examples 


For k = 0 (that is for the L*-metric), the corresponding Euler’s equation (2.8) is the 
inviscid Burgers equation 
uy + 3uu, = 0. (3.1) 


For k = 1 (that is for the H'!-metric), the corresponding Euler’s equation (2.8) is 
the Camassa—Holm equation 


Uy — Utxy + 3UUy — 2UyUyy — UUyyy = 0. (3.2) 


7That is, for all u, v € C%°(S!), Supp(u) N Supp(v) = @ => a(u, v) = 0. 
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3.3 Regular dual 


Since the topological dual of the Fréchet space Vect(S!) is isomorphic to the space 
of distributions on the circle, we use a subspace of this topological dual, called the 
regular dual, denoted Vect* (S') and consisting of the linear functionals of the form 


Ube mudx 
gl 


for some function m € C%(S!). The regular Vect*(S!) is naturally isomorphic 


to the space of quadratic differentials m(x)dx? (m € C%*(S')) on the circle, the 
pairing being given by 


(m,u) = / mu dx 
gl 


for u € g = C®(S!) and m € Vect*(S'). 
The coadjoint action of Diff(S!) on Vect* (S!) is given by 


i mog! e ; 1 
Ad, m = ———_,,, meg’ ,gé€ Diff(S’), 


(gr og") 
and the coadjoint action of Vect(S') on Vect*(S!) by 


ad* m = —(2muy + mxu), m € g*,u € Vect(S!). 


3.4 The momentum 
The conservation of the left momentum 
my = Adi mR 
leads [5] to the following conservation law 
(A(u) 0g) - 9: = Ao), 


along each geodesic curve ¢ issued from the unit element /d in the direction uo. 


3.5 The Cauchy problem 


Since there are no general local existence theorem for evolution equations on a 
Fréchet space, the first step in the study of a geodesic flow on the diffeomorphism 
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group is to prove the short-time existence and uniqueness. This was achieved by the 
authors [5] for the right-invariant metric on Diff(S!) generated by the H* Sobolev 
norm fork > 1. 


Theorem 3.1 (Constantin and Kolev 2003). Let k > 1. ForallT > 0, there exists 
a neighborhood of the origin V in Vect(S!) such that for all uo € V, there exists a 
unique geodesic 


y € C™ ([0, T); Diff(S')) 


for the metric H*, starting at p(0) = Id € Diff(S!) in the direction up € 
T;aDiff(S!). Moreover, the solution depends smoothly on the initial data uo € 
c(5"), 

Remark 3.2. For k = 0, which corresponds to the inviscid Burgers equation, one 
can show the short-time existence by the method of characteristics but the proof 
below does not apply. 


Remark 3.3. The spirit of the proof given in [5] is to study the evolution equation 
on each Hilbert space H” obtaining well-posedness on a maximal interval [0, T,,). 
Thereafter one checks that the decreasing sequence 7), does not go to0 asn — +00, 
ensuring thus the short-time existence on C° = meas HI". 

Notice that it is advisable to avoid considering directly the Euler equation 


u, = —B,(u,u) = —Az! [2A (w)ux + wAg(ux)] - (3.3) 


The reason is that A, is a differential operator of degree 2k, and therefore the right- 
hand side of (3.3) is a pseudo-differential operator of degree 1 because of the “bad 
term” uA; (ux). Hence the Cauchy problem for this equation has no meaning in H”. 

The following approach was used in [5] to overcome this difficulty. The operator 


Cy (u) = Ax (uu) — uA (uy) 


is a quadratic differential operator of degree 2k. Therefore, if k > 1, the right-hand 
side of 

Uy + uly = —A; | [2AK(u)ux — Ce(w)] (3.4) 
is a pseudo-differential operator of degree 0. Moreover, u; + uux is just v, 0 g!, 
where v = g = uo gis the Lagrangian velocity. 


Sketch of proof. The proof is divided into three steps. 


Step 1. For eachn > 2k + 1, we have a well-posed Cauchy problem on the Hilbert 
manifold D, x H"(S): 


Pt = V5 
vy, = Ryo Pro Ry-i(v), 
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where Py = —A;! o Ox and Ox (u) = 2Ax(u)ux — Cx (u). In fact, the map 


P(g, v) be (v. Ryo Peo Ry (0)) 


is a well-defined smooth (C”) vector field on the manifold D,, x H”(S) as shown 
in Lemma 3.3. 


Step 2. Let ¢ > 0. By the Cauchy theorem applied on D,, x H” (S), which is an open 
set of a Hilbert space, we know that there exists a positive number 7,,, such that for 
each uo in the ball B,,(0, ¢) in H”(S) there exists a unique solution of the Cauchy 
problem with initial data (0) = Id and v(0) = uo, defined on som e time interval 
[0, T,] with T,, > 0. Each solution of the Cauchy problem P,,+, is itself a solution 
of the Cauchy problem ?,,. What could happen is that the upper bound 7;*, (uo) 
of the maximal time interval of the solution of P,+1 is smaller than 7,* (uo). The 
second step consists in showing that in fact we have 


i. (uo) = Tei (uo) 


foreachn > 2k + 1. 


Step 3. The previous steps permit us to define for each n > 2k + 1 a map 
F, : H"(S) ON Boe41 (0, €) > C(O, Trx41]; Dn) 


which associates to each initial data uo the solution g of the Cauchy problem. This 
map depends smoothly on up as a consequence of the Cauchy theorem since Px is 
smooth. Moreover, we have 


Fy|Ht! = Fuvi 


from which we deduce that the inductive limit 
F :C™(S) A Bo 41(0, €) > C™([0, Tox 41]; D) 


is also a smooth map. oO 


Lemma 3.3. For each k > 1 and eachn > 2k + 1, the operator 


P(g, v) = (9. Ry ° Pk o R,(v)) 


is a smooth map from D, x H"(S) to H"(S) x H"(S). 


Remark 3.4. We cannot conclude directly from the smoothness of P, that Py 1S 
smooth because neither the composition nor the inversion are smooth maps on 
H" (S!). 
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Proof. To prove the smoothness of Py, we write it as the composition Py = A, 
Ox, where 


Ak(g.v) = (¢, Rg 0 Ago R= ()) 
and 
Ox(¢,¥) = (o. Ryo Qk Ry (0)). 


Note first that 


k 
Rg 0 Ax o Ry-i(v) = Yo? (vo gp )??) og 
p=0 


is a polynomial expression in the variables 


1 - 
— Pxrxreres gp?) 


(yk), 
Px 


Vi Vere is 
For example for k = 1, we get 


= Pxx 1 
Ryo Ayo Rg(vy) =v + arc a 


and to prove the general case, we let ap = (vo gy ')) oy, and use the recurrence 
relation 


! 
a =-—d@ 
p+l Dp: 
x 


A similar reasoning for Ry 0 Ox © Rg-1(v), where 


k 2p 


O;(u) = 2Ax(u)uy — YE? x Ci uOuer ty, 


p=0 i=l 


shows that it is also a polynomial expression in the variables 


To conclude that Ak and Ox are smooth maps from D, x H”(S) to D, x 
Sa (S), we use the following known facts (see for example [1]): 


(1) Forn > 1, H"(S) is a Banach algebra and hence polynomial maps on H”(S) 
are smooth. 
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(2) Forn > 1, the map H"(S) > H"~!(S), v& vx is smooth. 
(3) Forn > 1, the map H"(S) 9 {v > 0} > H”(S), v & 1/v is smooth. 


To show that AS! >D,y X Hn (S) > D, x H"(S) is smooth, we compute the 
derivative of A, at an arbitrary point (¢, v), obtaining 


~ Id 0 
DAMON) = ( * Ryo Ax aah 


It is clearly a bounded linear operator in view of the preceding analysis. Moreover, 
it is an invertible operator since A, is itself invertible. The application of the local 
inversion theorem in Banach spaces achieves the proof. Oo 


3.6 The exponential map 


In classical Riemannian geometry, the exponential chart and normal coordinates 
play a very special role. This is a key tool in the study of geodesics. 

On Diff(S!) the existence of this privileged chart is not ensured automatically. 
One may find it useful to recall on this occasion that the group exponential of 
Diff(S!) is not a local diffeomorphism.® We have a similar negative result for the 
Riemannian exponential map of the L? metric but a positive result for the H* metric 
ifk >1. 


Theorem 3.6 (Constantin and Kolev 2002). The Riemannian exponential map 
erp for the L?-metric on Diff(S') is not a local C '-diffeomorphism near the origin. 


Sketch of proof. Assuming erp to be a C! local diffeomorphism from a neighbor- 
hood of zero to a neighborhood of the identity map of Diff(S'), one can show 
(see [4]) that the derivative Derp(0) of erp at zero is the identity map, while 


(Dew w(x) = 5- [wordy 
for v(x) = c (constant). But this yields the contradiction 
Detp,, Wn = 0 
for 


Wn = sin(anx) €C°(S!) and vy, = -, n> 1, 
n 


8Indeed, this map is not locally surjective. Otherwise, every diffeomorphism sufficiently near to 
the identity (for the C° topology) would have a square root. However one can build (see [19]) 
diffeomorphisms arbitrary near to the identity which have exactly | periodic orbit of period 27. But 
the number of periodic orbits of even periods of the square of a diffeomorphism is always even. 
Therefore, such a diffeomorphism cannot have a square root. 
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since v, > 0 in C%(S!) asn — ov, while Derp(v) is supposed to be invertible 
in a neighborhood of 0 € C®(S!). Oo 


Theorem 3.7 (Constantin and Kolev 2003). Fork > 1, the Riemannian exponen- 
tial map exp for the H* -metric on Diff(S!), is a smooth local diffeomorphism near 
the origin. 


Sketch of proof: The approach relies on two important consequences of the con- 
servation law obtained in Section 3.4. [5], namely that whenever n > 2k + 1 we 
have: 


(i) Firstly, in the scale provided by the Sobolev spaces H”, the geodesic ¢,, (t) 
issuing from the identity in the direction of uo inherits at each instant t > 0 
exactly the regularity of uo (that is, if uw» ¢ H"*!, then g,,(t) ¢ H"*t! for 
t > 0); 

(ii) Secondly, for uw) € C%(S!) there is no function w € H” \ H"*! such that 
Derp,(w) € H"*!, 


Taking these two facts for granted, we proceed as follows. Since Derp is 
the identity map, the regularity properties of erp established in Section 3.5 in 
conjunction with the inverse function theorem in Hilbert spaces ensure the existence 
of two open neighborhoods V2.4; and O,4, of 0 € H?**!, respectively of the 
identity in H7**!, such that erp : Vor41 —> O2x-+1 is a smooth diffeomorphism with 
Derp,, : H+! + H***! bijective for every uo € Vox+1. We now claim that erp 
is a smooth diffeomorphism from V = Vr.4, NC*(S!) to O = O41, NC~(S!). 
Indeed, let up € V. The regularity properties of erp ensure that Derp,,, is a bounded 
linear operator from H” to H” for every n > 2k + 1. We now prove inductively 
that it is a bijection. For n = 2k + | this is so by our choice of V2.4; and 02,41. If 
itis true for 2k +1 < j <n, then Derp,, is injective as a bounded linear map from 
H"*! to H"*! since its extension to H” is injective. The second fact emphasized 
above ensures its surjectivity. Using now the inverse function theorem on Hilbert 
spaces, both erp and its inverse are smooth maps from V 1 H” toV M A”. Letting 
n — oo we obtain that erp is a smooth diffeomorphism near 0 € C%(S'). Oo 


4 Integrability of the Euler equations 


4.1 Lie-Poisson structure 


On the dual g* of a Lie algebra, there is a canonical Poisson structure°, defined by 


{fi ghrp (m) = m([din f. dmg), feecro"), 


°Tf g is the Lie algebra of a Lie group G, this structure corresponds to the reduction of the canonical 
symplectic structure on T*G by the left action of G on T*G. 
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and called the Lie-Poisson structure (see [25]). Each Euler equation on g* is 
Hamiltonian relatively to this structure. 

In the case of the Lie algebra Vect(S') of vector fields on the circle, this bracket 
is represented by the family of operators 


J(m) = —(mD + Dm), 


where D = d/dx. For an inertia operator A : Vect(S'!) — Vect*(S'), the 
Hamiltonian is given by 


1 
H4(m) = =f mA'(m), meg", 
2 Js 
the corresponding Hamiltonian vector field being 


X4(m) = —(mD + Dm)(A™'m). 


4.2 Bi-Hamiltonian structure 


One remarkable property shared by Burgers and Camassa—Holm equation is the 
existence of an infinite number of conservation laws for these equations. For 
example, the following functionals are commuting first integrals for the Burgers 
equation 


Hem) = f m* dx, (k = 1,2,...). 
sl 


The mechanism which is at the origin of the existence of these conservation 
laws is known as the Lenard scheme or bi-Hamiltonian formalism (see [23] for 
an excellent historical survey). It is characteristic of evolution equations in infinite 
dimension known as formally integrable, in analogy with classical integrable 
systems (in the sense of Liouville) in finite dimension. 

Two Poisson brackets {, }p and {, }g on the same manifold M, defined by 
Poisson bi-vectors P and Q are said compatible, if every linear combination of 
these brackets 


{ho tau =MAgte+ uh gto, AMER, 


is itself a Poisson bracket; that is, it is anti-symmetric, bilinear and it satisfies the 
Jacobi identity. 

When a vector field is Hamiltonian relatively to two compatible Poisson brackets, 
we Say that this vector field is bi-Hamiltonian. In the good cases, this leads to the 
existence of a hierarchy (H,,)nen of commuting first integrals (relatively to both 
structures). These functions are defined recursively by the so-called Lenard scheme: 


P dH, = O dAn4, a 
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4.3 Affine Poisson structures on g*. 


On the dual of a Lie algebra g*, one obtains a Poisson bracket by choosing a skew- 
symmetric bilinear functional y on g and letting 


{fg })(m) =vdnfidmg), fe ecg"). 


This bracket is compatible with the Lie—Poisson bracket if and only if y is a 2- 
cocycle,'® that is: 


y([u.v]l.w) + y(v. w).u) + y([w, uw], v) = 0, 


for all u,v,w € g. 
For instance, it happens that the Burgers equation is Hamiltonian relatively to the 
following cocycle of Vect(S') 


yur) = | uDvdx 
gl 


and the Camassa—Holm equation relatively to 


y(u,v) = [ u(D — D?)vdx. 


In fact, these examples are essentially unique (see [6], [16]). 


Theorem 4.1 (Constantin and Kolev, 2005). The only continuous, linear, invert- 
ible differential operators A : Vect(S'!) — Vect*(S!') with constant coefficients, 
whose corresponding Euler vector field X 4 is bi-Hamiltonian relatively to some 
modified Lie—Poisson structure are 


A=al+bD’, 


where a,b € R satisfy a—bn? # 0 foralln € Z. The second Hamiltonian structure 
is induced by the operator 


Q =-DA=-aD -bD’, 
where D = d/dx and the Hamiltonian function is 


A3(m) = sf (au? — bu(ux)”) dx, 


where m = Au. 


'0A special case occurs when this cocycle y is a coboundary i.e. y(u,v) = mo({u, v]) for some 
mg € g* (freezing structure). 
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5 The Virasoro group and Korteweg-de Vries equation 


Historically, the bi-Hamiltonian formalism has been introduced!! at the end of the 
1970s for the Korteweg-de Vries equation 


uy + 3uuy — CUyxxx = 0, ceER. 


Notice that the expression 3uu, — CU x x 1S not quadratic in u and therefore cannot 
be written as an Euler equation on Vect(S!). However, it has been shown in [14] 
that it can be written as an Euler equation for the L* metric on the Virasoro group,'* 
which is a central extension of Diff(S!) by R. This equation was already known in 
the seventies to be bi-Hamiltonian relatively to the two brackets on C®(S!) defined 
by the operators D and —(Dm + mD) + cD?. 
The Lie—Poisson bracket on the regular dual Vir* = C®(S') @R of the Virasoro 
algebra is represented by matrix 
J(m,a) = eu — ae: ) ; 


The functions F(m,qa) on Vir* which depend only on @ are therefore Casimir 
functions for the canonical structure on Vir*. In particular, the Hamiltonian flow 
for the L” right-invariant metric leaves invariant each hyperplane a = c (constant). 
The canonical structure induces on the hyperplane a = c, which is isomorphic to 
C®(S!), a Poisson structure represented by the operator —(Dm + mD) + cD?, 
which gives a geometric explanation for this operator. Notice that forc = 0, we 
recover the canonical Poisson structure on Vect(S!) and the Burgers equation. 
A similar approach can be pursued for the general Camassa—Holm equation 


Uy — Upyy + 3UUy — 2UyUyx — UUyxx + CUyyy = O, ceER. 


This equation can be obtained as the Euler equation for the H! right-invariant metric 
on the Virasoro group. 


By Gel’fand, Dorfman, Magri. See the review [23]. 
The composition in the Virasoro group Vir = Diff(S!) x R is given by 


(4.0) 0.) = (goy.at B+ BO. W)) 


where ‘ 
1 
BOW) = —5 | tosC(H(x)))x d log v(x) 


is the Bott cocycle. 
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The theorems which were stated for the diffeomorphism group Diff(S!) in 


Section 3 on the short-time existence of the geodesic flow and on the Riemannian 
exponential map for H* metrics are still true on the Virasoro group but the chart 
property of the Riemannian exponential map is true only for k > 2 in this case 


(see [8]). 
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cohomology classes 


J6zef Dodziuk, Jeffrey McGowan, and Peter Perry 


Dedicated to the memory of Serge Lang 


Abstract We give a construction of harmonic differentials that uniquely represent 
cohomology classes of a non-compact Riemann surface of finite topology. We 
construct these differentials by cutting off all cusps along horocycles and solving 
a suitable boundary value problem on the truncated surface. We then pass to the 
limit as the horocycle in each cusp recedes to infinity. 
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1 Introduction 


This paper contains a construction of unique harmonic representatives of de Rham 
cohomology classes of a noncompact Riemann surface X of finite geometry with 
precise estimates on the behavior of these forms at infinity. Our interest in this 
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question was motivated by an approach, used by Katsuda—Sunada [3] and Phillips— 
Sarnak [5]), to the problem of counting of closed geodesics in a fixed cohomology 
class. The approach is based on the Selberg trace formula applied to sections of 
the flat line bundle E, over X induced by the character x of the fundamental 
group. The characters are parametrized by the elements of the Jacobian torus 
H'(X,R)/H'(X, Z) and the cohomology classes in turn are parametrized, at least 
for compact surfaces, by the harmonic forms. In this approach, cf.[3] the harmonic 
forms are also used to “untwist” the Laplacian A , acting on sections of the bundles 
E,, replacing it by the unitarily equivalent operator Ly acting on functions on 
X or equivalently on automorphic functions on the upper half-plane. If y = 7. 
corresponds to the harmonic form w, then the operator 


Lyf = Af —4ni(df,o) + 42’ loll’ f, 


where the inner product and the norm are induced by the Poincaré metric of the 
upper half-plane. Thus the twisted Laplacians appear as a family of operators acting 
on a fixed Hilbert space and one can apply perturbation theory to this family. 
This works very well for compact hyperbolic manifolds since Hodge theory gives 
good control of harmonic forms. The program was successfully carried out by 
McGowan-Perry [4] for infinite volume hyperbolic manifolds without cusps and 
of arbitrary dimension. Their proof relied on an unpublished result of Mazzeo 
about representing cohomology classes in H'(X,R) for such manifolds (or more 
generally for conformally compact manifolds) by bounded harmonic forms. 

We hope to apply the results of this paper to the case of Riemann surfaces that 
have both funnels and cusps. We remark that the case of hyperbolic manifolds of 
finite volume was handled by Epstein [2] by a different method. 


2 Statement of the result and an outline of the method 


Let X = I'\H? be a geometrically finite Riemann surface with n cusps and k 
funnels. We will describe a construction of harmonic 1-forms on X representing 
cohomology classes. Every cohomology class is determined by its periods, i.e., its 
values on cycles. We are going to describe and construct harmonic forms with nice 
properties representing cohomology classes. We only consider the case where X 
is noncompact, i.e., 2 + k > 0 since the compact case is fully covered by Hodge 


theory. Let y!,...,y* be cycles at bases of cusps C!,...,C*. Fix a cohomology 
class c € H'(X,R) and let p; = (c,y'). 
We have 


Theorem 2.1. For every class c as above there exists a unique form @ with the 
following properties. 


1. @ is harmonic, i.e, dw = 0 andd *w = 0. 
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N 


Periods of w are determined by c, i.e., [,@ = (c,z) for every cycle z in X. 

3. Let X' be X with arbitrarily small neighborhoods of those punctures for which 
the period pj; is not equal to zero removed. The form w is in L? on X'. 

4, On every cusp C' choose coordinates (r,t) so that the metric on the cusp takes 
the form ds* = dr? +e~*' dt? and y; is determined by r = 0. The form w— p;dt 
isin L? onC'. 

5. w has a smooth extension to the surface X obtained by adding circles at infinity 

to all funnels of X and has the normal component equal to zero along these 

circles. 


Moreover, for every cusp C', there exists a constant A; > 0 such that 


lw _ pidt| (1) < Asere BE), 
Our construction of w will proceed as follows. We parametrize all cusps so that 
they are isometric with 


C = [-In2,o) x T! with the metric dr? + e7?" dt’, (1) 


where T! = R/Z. According to [1, Theorem 4.4.6], the cusps are disjoint. For 
every positive a we denote by X, the original surface with all cusps cut off at 
r = a. Consider Xp for large R. Xp is a hyperbolic surface with boundary all 
of whose components have length e~®. The surface Xp has a natural conformal 
compactification Xz whose boundary consists of horocycles cutting off the cusps 
and the circles at infinity added to funnels. The cohomology of Xz is naturally 
isomorphic to the cohomology of X and we use the de Rham-Hodge theory for 
manifolds with boundary [7, Proposition 4.2, Corollary 5.7] to construct a harmonic 
form wr on Xp that represents c and satisfies absolute boundary conditions 
(@r), = 0 on the boundary. Since * operator is a conformal invariant, wr|Xp 
is an L? harmonic form on Xp with periods prescribed by c. We now track wr as 
R — o and Xp develops into X . We show that for a subsequence of R, the limiting 
form @ exists and has the required properties. The convergence will be uniform on 
X, for every fixed a > 0 and the L? norms of wp will blow up when R > oo only 
in those truncated cusps for which the corresponding period is not equal to 0. 


3 Preliminaries 


In this section we set up notation, review conformal invariance of certain objects 
and justify several statements made in the introduction. Recall that for a positive 
function A on a Riemannian manifold (X, ds), the metric A7ds* is said to be 
conformally equivalent to ds”. If X is a surface, then the Hodge * operator acting on 
differential forms of degree one depends only on the conformal class of the metric. 
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The * operator is used to define the L? norm and harmonicity of forms. Namely, a 
differential (a form of degree one) w is harmonic if and only if 


dw=d*w=0 (2) 


and the inner product of two differentials is given by 
on=f oA -*N. (3) 
x 


It follows that the harmonicity and the L* inner product together with the 
associated norm are conformally invariant. Moreover, the normal direction at the 
boundary is clearly a conformal invariant. Thererefore, vanishing of the normal 
component of a differential at a boundary point is a conformally invariant condition 
as well. According to the definition (2) above a harmonic function is locally 
constant. This is too restrictive and we call a function harmonic if dh is a harmonic 
differential, i.e., if d * dh = 0 which is equivalent to the usual definition of a 
harmonic function. 

Every Riemannian surface (X, ds”) admits isothermal coordinates. More pre- 
cisely, every point of X has a coordinate neighborhood U and two function x 
and y defined on U such that (x, y) are local coordinates on U and the metric 
ds” is conformal to the Euclidean metric dx? + dy*. Now a restriction of a 
harmonic differential @ to U can be expressed as a(x, y)dx + b(x, y)dy and 
*w@|y = *(adx + bdy) = ady — bdx. The equations (2) translate to 


by =day and a, =—b, 


which are Cauchy-Riemann equations for the function f = a —ib. Thus, locally, 
we get a one-to-one correspondence between harmonic differentials adx + bdy and 
holomorphic functions f = a —ib of z = x + iy. Under this correspondence 


Ilo? = [ onso= [+e dvdy = [ sPaxay, (4) 


As aconsequence, if U is a punctured disk, we see that L? harmonic differentials on 
a punctured disk have a removable singularity. We also note that w can be recovered 
from f as follows: 


w = Re(f(z)dz). (5) 


For a cusp C and an interval J, we will write C; = JI x T', and to 
simplify the notation we set Cr = Cir.oo). We now describe a convenient conformal 
parametrization of cusps. In particular, we will be interested in Cjo,o9) with the 
metric as in (1). Of course Cjo,p] is conformally equivalent to an annulus in the 
Euclidean plane and Cjp,o9) is equivalent to a punctured disk. We will make this 
equivalence explicit. If (0, @) are polar coordinates in the plane, we set 0 = 2xt 
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and p = p(r) with p(0) = 1. We would like to map the cusp so that the base 
r = 0 is mapped onto the unit circle and the horocycle r = R goes onto a small 
circle around the origin. Thus, we need p to be a decreasing function of 7. An easy 
calculation shows that the metric (1) written in terms of p and 6 is given by 


dr\* ap\ et 
ds? =| —) (do? +(— de? |. 
(<;) ( e+(Z) 
The Euclidean metric is expressed in terms of polar coordinates as dp” + p*d6@. It 
follows that the two metrics are conformal if and only if 


Taking into account that p(0) = 1 and ae < 0 we solve this equation to obtain 
p= ete 1) (6) 


Finally, we observe that the expanding ends of X, so called funnels, are compact 
from the conformal point of view. Namely, let F = (—oo, oo) x T! with the 
hyperbolic Riemannian metric 


ds? = dr? + @ cosh’ r dt?. (7) 


Here £ is the length of the geodesic at the “waist” of the funnel. A calculation similar 
to the one for the cusp shows that F' is conformally equivalent to an annulus in the 
plane, and thus can be compactified by adding boundary circles. The conformal 
equivalence is given by the equations 


6 = 2nt, p= e2/9 So ak (8) 


Since the limits of p(7) at +00 are finite these equations define the mapping of F 
onto an annulus that can be compactified by adding boundary circles. We can use 
this procedure to compactify all funnels of X by adding a circle at infinity to each 
to obtain the surface with boundary X. 

We make a comment about the behavior of L* harmonic differentials near the 
boundary points of X. Since every circle in the z = x + iy plane can be mapped 
by a Mobius transformation onto the real axis, every boundary point of X has a 
neighborhood that is conformally equivalent to the intersection of a disk D centered 
at the origin intersected with the closed lower half-plane. Writing a differential w 
as w = adx + bdy we see that the requirement that the normal component of w 
vanishes on the boundary is equivalent to b(x, 0) = 0. It follows that the associated 
holomorphic function f = a—ib is real on the real axis so that Schwartz’s reflection 
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principle applies. Therefore both f(z) and w extend to D and the L? norms of either 
one on the upper and lower half-disks are equal. In addition, @ = dh in D fora 
harmonic function h determined uniquely up to an additive constant and satisfying 
h(x, y) = h(x,—y). In our conformal identification of a funnel with an annulus 
we mapped the added circle at infinity into the outer boundary of the annulus in the 
complex plane. The remark above implies that both w and f extend analytically 
with bounded L? norms to a larger annulus containing the image of the funnel in its 
interior. Moreover, if @ is exact on the whole annulus, then its primitive 1 extends 
as well and has equal values at points symmetric with respect to the boundary circle. 
Finally we recall the following elementary inequality, 


1 
| f(0)| < RJq | lrwe) (9) 


that holds for every holomorphic function f on a closed disk Dr centered at the 
origin in the complex plane. 


4 Proof of the theorem 


We first prove the existence of @. ||n||4 will denote the L* norm of a differential n 
on the set A C X. Now, for every a > 0 consider the harmonic differential w, on 


Xj, as in the introduction. 


Lemma 4.1. For every a > 0, there exists a constant m(a) > 0 such for all R = 
a+2, 


lor — Mati lly, < 1(4). 


Let C be one of the cusps C;, 1 <i <k, p = pj. For every R > 2 
lor — pat|leo x sm, 


where m2 > 0 is a constant independent of R. 


Proof. We prove the second estimate first since the proof in this case is somewhat 
simpler but contains all the essential aspects. The harmonic form wr — pdt is exact 
on Ci-in2,R] Since it is closed and its only period is equal to zero. Moreover, its 
normal component vanishes on the outer boundary yr of Cjo,r}. We use the notation 
y, = {r} x T!. If wr — pdt = dg, thend * g = 0 on Ca. Therefore 


Jor—pdtlyy =f ede 
0 


by Stokes’ formula. Observe that the function g is determined only up to an additive 
constant. Thus after adding a suitable constant we can assume that infc._,,,,, 8 = 1. 


Harmonic representatives for cuspidal cohomology classes 167 


It follows from the Harnack inequality [6, Theorem 21] that |g| is bounded on 
C-(1/2) n2,1/2] by a constant independent of R. Now for a harmonic function, bounds 
on the function imply bounds on derivatives [6, Section 13]. Therefore |dg]| is 
bounded on yo as well. This proves the second inequality in the lemma. 

Consider the form wr — @g+; on Xq41. This form is harmonic and has normal 
component equal to zero along all circles added to compactify the funnels. By the 
discussion at the end of Section 3 this form has an extension to an L? harmonic form 
¢ defined on the surface Y obtained by adding annuli to the funnels. The homology 
of Y is carried by X,+; and therefore all periods of ¢ are equal to zero. It follows 
that @ is exact, i.e., 6 = dh for a harmonic function h on Y. Since ¢ = Wr — Wa4+1 
on Xq+1, the normal component of dh vanishes on circles that bound funnels. Using 
Stokes’ formula and the fact that dh \ «dh = d(h x dh) we see as above that 


Jon —onsil, = [dh axdh = > hx dh, (10) 
Xq 


a 
iT; 


where y/, is the cycle r = a in C’. We now need to estimate h and dh on y%. 
The function / is determined only up to an additive constant. Therefore we can 
assume that min h|y5; = 1. As we saw above, / extends to a harmonic function 
h, on Y that satisfies the same lower bound as h, i.e., hy > 1. By the Harnack 
inequality there exists a constant A > 0 that depends only on the geometry so that 
h(p) < A for every p € Xq+41/2. Now, for a harmonic function, the bounds on the 
function on a disk around a point can be translated into bounds on partial derivatives 
at the center. It follows that we can bound dh uniformly on I! by a constant that 
depends only on the original surface X and on a. This finishes the proof. oO 


It follows from the first inequality in the lemma that for a fixed a the norms 
|@r||x, are bounded independently of R. Locally L? harmonic forms can be 
identified with L* holomorphic functions. A uniform bound on L? norms implies 
via (9) that the family wr is a “normal family.” All the estimates apply via the 
reflection principle up to the boundary of X,. A standard diagonal argument shows 
there exists a sequence Rk; — oo such that wr, converges uniformly on X, for 
every fixed a > 1 to a harmonic form w. Obviously, w is harmonic, has prescribed 
periods, and its normal component on circles at infinity vanishes. 

It remains to investigate the behavior of w in cusps. On a given cusp C = C’, 
consider the form 7, which is equal to wr; — pdt extended by zero to Cyr; oo). 
By Fatou’s lemma, using the second inequality in Lemma 4.1, its pointwise limit 
w — pdt is in L* on Cpo.o0). This finishes the existence part of the proof. 

The uniqueness is easy. If w; and w2 are two forms satisfying our conditions, 
then 7 = @ — @» is an L* harmonic differential on X with vanishing periods. 
Singularities of 7 in cusps are removable. Consider the Riemann surface Z obtained 
from X by closing the punctures and adding circles at infinity to funnels. The form 
n extends to Z, is harmonic, has zero periods and satisfies the absolute boundary 
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conditions 7, = 0 on the boundary of Z. Therefore, 7 = dh on Z, h is a harmonic 
function and 


Inlz, =f dhasdn= [han =o 
1 


Xx 


and @; —@. =n = 0. 

Finally, we prove the decay estimate of w — p;dt in cusps. We identify C' with 
a punctured disk with coordinate zg = pe!” and use the correspondence between L” 
harmonic differentials and L* harmonic forms w— pdt = Re( f(z)dz). The function 
F(z) must have a removable singularity at zero and is therefore bounded. Thus it is 
enough to estimate the pointwise norm of dz = d(pe'®) = e!°dp + pie'’dé in 
terms of the hyperbolic metric ds? = dr? + e’ dt?. A calculation using (1) and (6) 
yields 


|dp| = 2me"e 2") and = |d0| = 2n|dt| = 27". 


Thus |dz| < |dp| + 2ze’ p which in view of (6) implies our estimate. 
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About the ABC Conjecture and an alternative 


Machiel van Frankenhuijsen 


In memory of Serge Lang 


Abstract After a detailed discussion of the ABC Conjecture, we discuss three al- 
ternative conjectures proposed by Baker in 2004. The third alternative is particularly 
interesting, because there may be a way to prove it using the methods of linear forms 
in logarithms. 
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1 Introduction 


I met Serge first around 1990 in Utrecht, when he gave a talk there and I was a 
student in Nijmegen. I asked him a question during the break, and at some point 
during the second half, he suddenly pointed a finger at me, asking “let’s see what 
they teach in Nijmegen: when you have a meromorphic function with simple poles 
and integer residues, what do you do?” After the talk I was surprised and happy 
when he insisted that I join the group to a restaurant. On the way back in the train, we 
talked about mathematics, and about a week or two later I was even more surprised 
when a small box full of books arrived. Since then, we have been in contact regularly 
until Serge passed away. 
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During my thesis research, I asked Serge by email if I could visit him at Yale. 
He immediately called back by phone, saying “No, it’s impossible, I cannot find 
a place for you to stay.” Half an hour later he called again: he had found a place 
for me. 

I fondly remember the lunches while at Yale. One day, I explained to him 
the theorem of Stewart and Tijdeman that the error term in the ABC Conjecture 
is of order h!/2, and my idea that this should be related to the fact that the 
zeros of the Riemann zeta function have real part 1/2. He looked at me for 
a moment and then said: “This is insight! Now you have to work until you 
prove it.” 

These and other experiences with Serge have been truly inspiring for me. I 
consider Serge to be one of the greatest teachers that I have had. He is the one 
who taught me how to do research (“Formulate the theorem and prove it!”), and 
how to write it up (when I asked him one time how he wrote so many books, 
he looked at me with a puzzled expression as if to say “just start and never 
stop!’”). 

Serge had a great insight into the interconnection of questions of geometry and 
Diophantine analysis, thus contributing to the field of Diophantine geometry. One 
of his latest insights was to emphasize the difference between the error terms in 
the Vojta Height Inequality and the radicalized version of this inequality. This is 
well explained in the manuscript Questions about the error term of Diophantine 
inequalities [La05], which, I think, has been left unfinished. We were in contact 
about this shortly before Serge died. The paper [vF06] was initiated by his question 
whether the implication of the title is true (Serge’s intuition shines through the fact 
that when he contacted me by phone about this question, I was not sure immediately 
if this could be proved, but soon I realized how the argument of [vF04] could be 
modified). Other papers were also inspired by Serge’s generous sharing of ideas and 
enormous drive. 

In this paper, we explore another of Serge’s questions, asked around the same 
time, about an alternative to the ABC Conjecture proposed by Baker. We first 
explain the ABC Conjecture, and illustrate it with several diagrams. Then we explain 
and discuss Baker’s alternative. 

I sent an early draft of this paper to Serge shortly before he passed away. The 
draft contained some graphs and a very incomplete text, which quite upset Serge to 
my regret. The present paper explains much clearer the available data. The reader 
will see that no firm conclusion can be arrived at. More data, obtained by a larger 
exhaustive search of abc sums, is needed for a more definite conclusion. Such data 
is being assembled in the project Reken mee met ABC [LPSO9]. 


Acknowledgement We thank the referee for suggesting a number of important improvements to 
this paper. David Masser and Joseph Oesterlé kindly shared their recollection of the origins of the 
ABC Conjecture, and Hendrik Lenstra improved the paper by asking some insightful questions. 
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2 The ABC Conjecture 


In 1985, Masser attended a talk by Oesterlé that involved elliptic curves (and 
possibly Szpiro’s conjecture, which is closely related, see (5) below). It reminded 
him of another recent development, a theorem for polynomials by Mason [Ma84] 
(see also [Si84, Sto81]), and he formulated the ABC Conjecture in July 1985 at a 
London conference in honour of Roth’s sixtieth birthday. Only the “Open Problems” 
were published in some form,! but in 1986, as a possible approach to Fermat’s Last 
Theorem, Oesterlé formulated the ABC Conjecture in the Bourbaki seminar [O88]. 


The height of an abc sum P:a + b = c of coprime integers is 
h(P) = max{log |a|, log |b|, log |c|}, (1) 
and the (logarithmic) radical of P is defined by 


r(P) = > log p. (2) 


plabc 


With these definitions, the ABC Conjecture can be formulated as follows. 


Conjecture 1. There exists a function y such that limp+oo w(h)/h = 0 and 
h(P)—r(P) < wA(P)) (3) 


for all abc sums P. 


Smirnov explains in [Sm93] that the ABC Conjecture can be interpreted as 
a Riemann—Hurwitz inequality for rational numbers, interpreted as functions on 
spec Z (see also the appendix of [Mas02]). 

In [StTi86], using an extension of Baker’s theory of bounds for logarithmic 
forms, Stewart and Tijdeman obtain that the height of every abc sum is bounded 
by a power of its exponential radical: 


h(P) « eb"), (4) 


Apart from subsequent improvements of the exponent 15 to eventually 1/3 + ¢ 
[StYO1], this is still the best known result. With this result, one sees that if the radical 
is fixed, i.e., if a, b and c are to be formed using only prime factors from a fixed set 
of prime numbers, then a, b and c are bounded, and hence there exist only finitely 


'See [GdS07], which is also of interest to non-Dutch readers for a photographic reproduction of 
the relevant page. 
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many such abc sums.” Equivalently, in every infinite sequence of abc sums, r(P) is 
unbounded. However, (4) is too weak to imply Fermat’s Last Theorem. 

It is easy to deduce Fermat’s Last Theorem from the ABC Conjecture, provided 
the function y is explicitly known.’ We will give the argument here, to make the 
point that, roughly speaking, the ABC Conjecture says that the exponents of the 
prime factors of a, b and c are at most 3 + € on average. Suppose that x” + y” = 2” 
and n > 4. Then we have an abc sum P of height n log z and radical 


r(P) = > log p < log xyz < 3 logz. 


p\xyz 


By the ABC Conjecture, (n —3) logz < w(n log z). Since y(t) = o(h), there exists 
an ho such that y(h) < h/4 for h > ho. Then, for z > e”°/4 we have n logz > ho, 
and we conclude that n—3 < n/4,andhencen < 4. There remain only finitely many 
values for z to check, and for each of these values, we need only check exponents 
n <ho/ logz. 

We see that the ABC Conjecture does not allow us to prove Fermat’s Last 
Theorem for exponent 3. However, we can get arbitrarily close to exponent 3. 
Indeed, letting z — oo in the above argument, we see that n — 3 < o(1). Szpiro 
conjectured that 


log(abc) _ 
“(Py _ 


lim sup 
where the limsup is over all abc sums P. 
Defining the quality of an abc sum P as g(P) = h(P)/r(P), one could state 
the ABC Conjecture alternatively as 


lim sup qg(P) = 1. 


Also in the paper [StTi86], Stewart and Tijdeman prove that there exist infinitely 
many abc sums such that the height is larger than the radical. Their result was 
subsequently improved by the author [vFO0] to 


JP) 


for infinitely many abe sums P: h(P)—r(P) > 6.07 : 
logh(P) 


(6) 


7In other words, (4) implies the classical result of Siegel and Mahler on the S-unit equation. The 
innovation of Stewart and Tijdeman was to use Baker’s theorem on linear forms in logarithms, 
generalized to p-adic logarithms, to make this result effective. 


3If yw is not explicitly known, one would deduce that there could only be finitely many 


counterexamples to Fermat’s Last Theorem, but one would not know when to stop looking for 
one. 
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It follows that there exist infinitely many abc sums with g(P) > 1. We computed 
the quality from two tables. The first table, by Benne de Weger, lists all abc sums 
with g(P) > 1.2 up toc = 2” (ie., up to height 22.18; our diagrams reflect 
this by a high density of data points up to this height), and the second table lists 
abc sums with g(P) > 1.28 up to a height of 50, compiled by the author from 
Abderrahmane Nitaj’s tables [N09]. The first table is obtained by an exhaustive 
search, but the second table is not exhaustive.* These data indicate that the quality 
might be bounded by 1.6299, the quality of the abc sum 2+3!°x 109 = 23° of height 
15.6775. However, at present, even a proof of a statement such as g(P) < 1,000 for 
every P would be a great theorem. 

The disadvantage of the quality is that for any bound g > 1, one expects only 
finitely many abc sums of quality larger than gq. Thus up to height 22.18, and 
as Figure | suggests, probably well beyond this height, one finds many sums for 
which q(P) is about 1.2. One would like to discard those and only record the 
more interesting sums with g(P) > 1.4. But for larger heights, probably when 
h(P) > 100, there may not be any point with g(P) > 1.4, and already examples 
with q(P) > 1.2 are interesting. Thus one would like to have a criterion that adapts 
with the height and would tell us which abe sums are interesting and which ones 
should be regarded as “common” or “too abundant’, a criterion that becomes less 
restrictive as the height increases. 

By (6), if a function y as in Conjecture 1 exists, then y(h) > 6.07Vh/logh. 
This provides us with the kind of adaptive criterion as was mentioned at the end of 
the previous paragraph° (see the curved line in Figure 1). Thus, we regard an abe 
sum as interesting if h—r > 6.07/h/ log h. In terms of the quality, this means that 
interesting abc sums have a quality of at least 


1 
6.07 
(2 |=. _.. |) + 
Vh(P) logh(P) 
Thus up to height 22.18, interesting abc sums have a quality of at least 1.72 (hence 


in this sense, there are no interesting abc sums of height less than 22.18). The first 
interesting abc sum is 


5° 4.29 x 3!" x 13? = 11° x 17 x 31° x 137, 


with a height of 30.0446, and for heights between 30 and 50, a quality of 1.28 
already makes an abc sum interesting. 


4We have omitted from our table all abc sums with h > 50, since beyond a height of 50 our table 
is definitely not exhaustive and therefore useless. By November 2009, the project [LPS09] had 
resulted in an exhaustive search up to height 29.9337 (i.e., up toc = 10!%, apparently improved to 
10° [NO9]). Schulmeiss has found some very large abc sums that satisfy (6), the largest of which 
has a height of 5,114. Since these sums were not obtained by an exhaustive search, they are less 
useful to check different versions of the ABC Conjecture. 

5This criterion is closely related to the “merit”, see [GdS07, dS09]. See also (7) below, which 
contains the same information as an inequality for the merit. 
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Fig. 1 The points (h(P), h(P) — r(P)) and the graph of 6.07/h/logh. 


In Figure 1, the points (4(P),4(P) — r(P)) are compared with the graph of 
6.07V/h/logh. This diagram seems to indicate that h — r grows linearly in h for 
infinitely many abe sums, which would contradict the ABC Conjecture. However, 
only a small portion of the sums in the tables satisfy the inequality (6). We expect 
that as more abc sums become available that satisfy (6), it will become clear that 
h(P) — r(P) is never much larger than ,/h(P). Indeed, in [vF95], the author has 
given a heuristic argument showing that 


h(P)—r(P) « Vh(P)/logh(P) (7) 


for all abc sums P (see also [StTe]). 
Clearly, by (6), 
log(h(P) —r(P)) = 1 
logh(P) ~ 2 


lim su 
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Fig. 2. log( — r) compared with log h. 


In view of the heuristic inequality (7), we expect that actually, equality holds here.° 
The double logarithmic plot in Figure 2 of the points 


(1s np), REAP) = >) 


logh(P) 
suggests that the maximum of log(h — r)/logh is 0.733258, for the abe sum 


19 x 1307 +7 x 297 x 318 = 28 x 322 x 54, 


6 As alluded to in the introduction, the value 1/2 may be related to the Riemann Hypothesis. Michel 
Waldschmidt pointed out to me that the most accessible approach to such a connection may be to 
construct a sequence of abe sums such that h(P) — r(P) > h(P)°~®, given a hypothetical zero of 
the Riemann zeta function with real part 6 > 1/2. 
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of height 36.1524, logh = 3.3877. It is clear however that the available data is 
insufficient to reach a definite conclusion, and one expects more definite information 
to emerge from a complete table of abc examples up to a height of at least about 80. 


3 Baker’s alternative conjectures 


Let w(n) be the number of prime factors of the natural number n. By the 
prime number theorem, w(n) is at most of size logn/loglogn, and by [HW60, 
Theorem 430], w(1) ~ log logn on average. Clearly, for an abc sum a + b = c, the 
number of prime factors of abc equals w(abc) = w(a) + w(b) + w(c). In [B04], 
Baker proposes the following conjecture: 


Conjecture 2. There exists a constant K such that 
h(P)—r(P) < wlogr(P) —(@+ 1/2) loga +a+ K 
for all abc sums P:a + b = c composed of w = w(abc) different prime numbers. 


In Figure 3, we compare Baker’s conjecture with the available data from our 
tables. Clearly, the diagram indicates that the constant K in Conjecture 2 could 
probably be taken to be —0.8. 


Remark 1. By Chen’s theorem [C73, C78], there are infinitely many n and prime 
numbers p,g andr such that 2” = p + qr. Thus, w(abc) is bounded (by 4) for 
an infinite sequence of abc sums. In these examples, the radical r(P) = log(2pqr) 
is about 2h(P). On the other hand, for the examples in our table with h(P) > 
r(P), especially those that satisfy (6), w(abc) is always a relatively large value. 
Indeed, if Conjecture 2 holds, those abc sums are composed of at least w(abc) = 


O(Vh/(ogh)*/?) primes. 


Our formulation of Conjecture 2 is not Baker’s original formulation. Baker gives 
two formulations, one based on ©(R), the number of integers < R composed of 
prime factors of R. This is equivalent to Conjecture 2 (see [B96, §7]). Baker states 
this conjecture in an exponential form involving w!, where we have used Stirling’s 
formula to replace log w! by (w + 1/2) log w —w. Our formulation seems to exhibit 
more clearly the relative importance of the different terms. Ignoring all terms of 
lower order, one could rephrase the conjecture in a weaker form as follows: 


Conjecture 3. There exists a constant « such that for all abc sums P:a +b=c 
composed of w = w(abc) different prime numbers, 


h(P)—r(P) < ko log(r(P)/o). 


At present, it seems hopeless to prove the ABC Conjecture itself. It derives its 
interest mainly from the fact that it allows us to test other conjectures. However, 
Baker continues by stating his most interesting conjecture. It has the same strength 
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Fig. 3 The size of A = (h —r) — (a log(r) — (@ + 1/2) log(w) + a). 


as the ABC Conjecture in the sense that it implies Fermat’s Last Theorem, and it 
may be possible to prove it using the methods of linear forms in logarithms, as Baker 
explains in [B04, §2] and [B96]. 


Conjecture 4. There exists a constant « such that for all abc sums P:a +b =c, 
h(P) —r(P) < kw(ab) log(r(P)/a@(ab)). 


Remark 2. The estimate for linear forms in logarithms that is discussed in [B96, 
§4] involves a product of logarithmic heights. If one could obtain a similar estimate 
with the product replaced by a sum of the logarithmic heights, one could deduce 
Conjecture 4. The reader can find an example where a product is successfully 
replaced by a sum in the Remark on page 37 of [LC90]. 
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For the connection with linear forms in logarithms, it is essential that w(ab) in 
Conjecture 4 only depends on two of the variables. Baker does not specify on which 
two variables, so permuting a, b and c (and adjusting the signs), the weakest form 
of Conjecture 4 is obtained when w(ab) has the maximal value, 


w(ab) = @max = max{w(ab), w(bc), w(ac)}, 
and the strongest form is obtained for 
w(ab) = @min = min{w(ab), w(bc), w(ac)}. 


Clearly, Conjecture 4 implies Conjecture 3 with the same value for x, and 
Conjecture 4 with k = | implies Conjecture 2. On the other hand, for the weakest 
form of Conjecture 4, with @max for w(ab), we have’ 


3 
w(abc) < 7 Pmax ; 


so Conjecture 2 implies Conjecture 3 with k = 3/2 and w(ab) = max. 

In Figure 4, we graph (h — r)/ (min log(r/@min)). Thus one could probably take 
k = 2.04, even in the strongest form of Conjecture 4. The extreme example is the 
abc sum 


1+2°x3x5x7x 134 x 17 = 2394, 


of height 21.9058. 
Reasoning as before, each one of the conjectures 2, 3 and 4 imply Fermat’s Last 
Theorem. Indeed, let x” + y” = z". Then 
w(abc) = w(xyz) < 3logz/ log log z. 
Also w(ab) < 3 log z/ log log z. Using Conjecture 3 or 4, one obtains 
(n — 3) logz < 3k log zlog log log z/ log log z. 
It follows that n —3 < o(1), which implies, as before, that there remain only finitely 


many cases to check. And if « is explicitly known, this can be done in a finite 
search. 


TIf w(c) is the least value among w(a), w(b) and w(c), then @max = @(a) + @(b) and w(abc) = 


Omax + O(C) S Omax + 3(@(a) + 0()). 
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Fig. 4 h —r compared with @pin log(7/@min), plotted against the height. 
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Abstract Belyi’s Theorem states that every curve defined over the field of algebraic 
numbers admits a map to the projective line with at most three branch points. This 
paper describes a unifying framework, reaching across several different areas of 
mathematics, inside which Belyi’s Theorem can be understood. The paper explains 
connections between Belyi’s Theorem and (1) The arithmetic and modularity of 
elliptic curves, (2) abc-type problems and (3) moduli spaces of pointed curves. 
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1 Introduction 


Belyi’s Theorem, the main object of our study, states: 


Theorem 1.1 (Belyi [Bel80], [Bel02]). Let X bea connected, smooth, projective 
curve defined over the field of algebraic numbers Q. Then there exists a morphism 


g:X—P! (1.0.1) 


with 
Branch Locus (g) C {0, 1, co}. (1.0.2) 
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Belyi’s Theorem is an elementary result about curves defined over the algebraic 
numbers Q. One surprising aspect of Belyi’s Theorem is that not only is the 
statement of the theorem elementary, but so are the only two known proofs (both 
due to Belyi himself). Nothing but the rudiments of the theory of ramification for 
maps between curves is called upon. These notions were known to Riemann, yet 
Belyi’s Theorem was only discovered in 1979, more than a hundred years later. 

This paper describes a unifying framework, reaching across several different ar- 
eas of mathematics, to understand Belyi’s Theorem. The paper explains connections 
between Belyi’s Theorem and (1) the arithmetic and modularity of elliptic curves, 
(2) abc-type problems and (3) moduli spaces of pointed curves. 

First, §2 presents Belyi’s two different proofs of his theorem. In each proof we 
attempt to motivate Belyi’s key constructions. Next, §3 is devoted to the connection 
between Belyi’s Theorem and the abc Polynomial Theorem. §4 discusses theorems 
and conjectures that offer generalizations of Belyi’s theorem to characteristic p 
(§4.1), function fields (§4.2) and higher dimensions (§4.3). In Section 5, we give 
an exposition of a question of C. Khare, which is an attempt to relate the modularity 
of elliptic curves to Belyi’s Theorem. 

The first half of this paper (§§2,3) is completely elementary. The second half 
(§§4,5) uses a bit of algebraic geometry. Some knowledge of automorphic forms is 
helpful for Section 5. 


2 Belyi’s Proofs 


Section 2 is devoted to the presentation of Belyi’s two proofs of his theorem. For a 
map y : C; — C) of smooth, projective curves, we write B(¢~) for the branch locus 
of g and for a point P € C), we let e,(P) denote the ramification index of g at P. 
Given X as in Theorem 1.1, we call a map ¢ satisfying (1.0.1) and (1.0.2) a Belyi 
map for X. 


2.1 Outline of section 2 and the proofs 


In the interest of making the exposition self-contained, we begin by recording the 
well-known fact (Lemma 2.1 below) that ramification indices are multiplicative 
under a composition of morphisms. The basic structure of both of Belyi’s proofs 
is the same. The general strategy is to start with an arbitrary map hy : X —> P! 
and “refine” it by composing it with a sequence of maps from P! to itself, which 
gradually reduce the complexity of the branch locus, until it becomes {0, 1, co}, the 
simplest it could possibly be. The complexity of the branch locus is reduced in two 
steps, which we call the “Q to Q step” and the “Q to {0, 1,00} step”, stated and 
proved as Theorems 2.2 and 2.4 respectively. Belyi’s two proofs are the same as far 
as the “Q to Q step” is concerned. However the two proofs offer genuinely different 
approaches to the “Q to {0, 1, co} step”. 
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We present Belyi’s proofs in the opposite order from that in which they were 
historically discovered by Belyi. We do this for two reasons. First, we believe that, 
although it is more sophisticated and harder to understand at first, Belyi’s second 
proof (Proof 1 of Theorem 2.4 below) offers more insight and is more efficient (see 
also Question 2.7), whilst it is not less elementary than Belyi’s first proof (Proof 2 of 
Theorem 2.4 below). For example, Belyi’s second proof of the “Q to {0, 1, co} step” 
requires a single self-map of P', whereas the first proof requires (possibly) many 
such maps. Our second reason for the order of presentation is that Belyi’s second 
proof has gone almost unnoticed!, while the first proof from [Bel80] is well known 
and many expositions of it are available. To our knowledge, there is no reference 
containing Belyi’s second proof other than Belyi’s original paper [Bel02]. 

Since Belyi’s second proof uses Vandermonde determinants, we have included a 
subsection which briefly recalls the basic properties of Vandermonde determinants 
needed in the proof. 

Finally, section 2 ends with some further remarks about Belyi’s theorem and its 
proofs. 


2.2 The Two Proofs 


We shall make repeated use of the following well-known property of ramification 
indices, which we state as a lemma. 


Lemma 2.1 (Multiplicativity of Ramification Indices). Suppose 
? v 
Cy —+ C2 —> Cy (2.2.1) 
is a composition of maps of smooth projective curves. Then for all P € C,, one has 


Cyoo(P) = ey(p(P))eg(P). (2.2.2) 


Theorem 2.2 (Branching Reduction from P'(Q) to P'(Q)). Let ho : X > P! be 
a morphism satisfying B(ho) C P!(Q). Then there exists a morphism hz : P' > P! 
such that the composition 


hg oho: X — P' satisfies B(hg oho) C P'(Q). (2.2.3) 


Proof. We begin by recursively defining a sequence of polynomials and branch 
sets. For a finite set S C Q, we use the notation ms g(x) to denote the minimal 
polynomial of S' over Q i.e., m5 Q(x) is the monic polynomial of smallest degree 


‘Apparently one reason for this is that [Bel02] was an MPI preprint for several years that was 
difficult to access before it was published. 
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with rational coefficients having all the elements of S as roots. Note that since we 
do not require the elements of S' to be Galois conjugate, the minimal polynomial 
Ms,g(x) may be reducible. Put 


Bo = Bho) — {oo} , tix) = ma.g(), (2.2.4) 
and define recursively 
B; = Bth;) — {oo} , hi (x) = mp,_, Q(X). (2.2.5) 


Fori > 1, one can think of the h; as maps h; : P! +P, totally ramified at oo. 


Claim 1: For alli > 1, if degh; > 1, then degh;4,; < degh; — 1. The point here 
is that the branch sets B; are Galois stable fori > 1 ie., for allo € Gal(Q/ Q) 
anda € B; one has oa € 8B;. Indeed, for all i > 1 the finite branch sets B; can be 
described by means of derivatives as 


Ifa € B;, say a = h;(a), then 
oa = o(h;(a)) = h;(oa) 


since h; € Q[x] and 


hi (oa) = o(hi(a)) = 0(0) = 0 


since hi € Q[x] and h;(a) € B;. 
Since B; is Galois stable, we have 


hisi(x) = [] (-a) (2.2.7) 


aeB; 


for all 7. In particular deg ; +, = | B;|. From (2.2.6) we see that if degh; > 1, then 
|B; | < deg h; — 1, so the claim is true. 

By the claim, there is a positive integer € such that /y is linear, ie., deghy = 1. 
Consider 


hp = hey ohy_.0++-0 hy (2.2.8) 
and puth = hg oho. 


Claim 2: B(h) c P'(Q). To see this, we use the multiplicativity of ramification 
indices (Lemma 2.1). Suppose h is ramified at P € X above Q € P! and Q 4 ow. 
By Lemma 2.1, there exists 7,0 <i < €—1, such that 


Cn; ((hi-1 ohj_» ooh, 0 ho)(P)) >1. (2.2.9) 
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Case I: i < €—1. By definition of B;, (2.2.9) implies that 
(h; ohj-; 0---oh, oho)(P) € B;, (2.2.10) 


so 
(hi41 Ohj o-+- 0h, ohp)(P) = 0. (2.2.11) 


Since h; € Q[x] for alli > 1, it follows that 


(he-| os ohj44 oh; o+--ohy, 0 ho)(P) = h(P) = Q € Q, (2.2.12) 
as desired. 
Case II: i = € —1. Since hy is linear and he(x) = ma_p,_,(x), we must have 
|Be_1| = 1 and Be_; C Q. But O € By_; so €Q. oO 


As an interlude between the two steps of Belyi’s proofs, we recall some basic 
facts about Vandermonde determinants. 


Vandermonde determinants. Let K be a field. Given a),...,a@, © K, the 
Vandermonde determinant of @1,...,Q, 18 
to a a. 2? 
Ty 2! ee 
V(q,...,Q@,) =]... P ; . ; (2.2.13) 
1 a o ... qt! 


n n 


The Vandermonde determinant satisfies Vandermonde’s Identity: 


Lemma 2.3. One has 


V(m,...,) =] [(@ —4;). (2.2.14) 


j<i 


Proof. The proof is by induction on n. Observe that (2.2.14) holds forn = 2, as 
both sides of the equation are equal to a — a. We may assume that a; ¢ a; for 
i # j since otherwise both sides of (2.2.14) are 0. 

Define a polynomial v(t) of one variable ¢ by v(t) = V(q,...,@,—1,¢). The 
degree of v(t) is n — 1. Let c be the leading coefficient of v(t). By expanding the 
determinant defining v(t) along the bottom row, we see that c = V(q),...,Qn-1). 
Now v(q@;) = 0 for 1 <i <n-—1, since the determinant giving V(q,..., @,—1, @%) 
has the ith and nth rows equal. Hence t—a; divides v(t) and since the a; are distinct, 
we obtain 


n—-1 


v(t) =c | [(t- a2). (2.2.15) 


i=1 
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Using induction to express c as a product, (2.2.15) becomes 


n—-1 


w= [] @j-o%)[[@-a). (2.2.16) 
1<k<j<n-1 i=1 
Substituting a, into (2.2.16) gives (2.2.14). oO 


The Vandermonde determinant also satisfies a scalar translation invariance 
property: For all 6 € K, translating all the entries a1,...,a@, by 6 does not affect 
the Vandermonde determinant. More precisely, 


Via, + B,...,Q@, + B) = V(qy,..., Qn). (2.2.17) 


This translation invariance is an immediate consequence of the Vandermonde 
Identity (2.2.14), since each term in the factorization of the left-hand side of (2.2.17) 
is of the form (a; + 8)—(a@; +8) = a; —a;. The translation invariance (2.2.17) can 
also be seen directly from the definition of the Vandermonde determinant, without 
using the Vandermonde identity, by means of the multilinearity of the determinant. 
We will make use of the translation invariance property of the Vandermonde 
determinant at the end of the first proof of Theorem 2.4. 
We now come to the second part of Belyi’s proofs, the “Q to {0, 1, 00} step”. 


Theorem 2.4 (Branching Reduction from P! (Q) to {0, 1, 00}). Assumeh: X > 
P! is a morphism with B(h) C P'(Q). Then there exists a map g : P! > P! such 
that the composite g 0h is a Belyi map for X. 


Before embarking on the proofs of this theorem, let us note the following 
equivalent formulation. Suppose g : P! — P! and put S = B(h). By the 
multiplicativity of ramification indices, we see that g o h is a Belyi map for X if 
and only if g satisfies the two conditions: 


(i) g(S) C {0, 1, co} 
(ii) B(g) C {0, 1, 00}. 


First proof of Theorem 2.4: This is Belyi’s second construction, which uses 
Vandermonde determinants. We proceed to construct a function g : P! > P! 
satisfying conditions (i) and (ii). Let S = B(h) C P!(Q). By composing with a 
fractional linear transformation (which is everywhere unramified) we may assume 


S = {Ay <Aq < +++ < An} U {00} (2.2.18) 
with A; € Zand aA, = 0. 
Consider 
g(x) =] [@-aiy”" (2.2.19) 
i=l 
with 


Me = IAA TV Oysters dd) (2.2.20) 
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where the term with a hat GO.) is to be omitted. By choosing g to have the above 
form, we immediately ensure that g satisfies condition (i), regardless of our choice 
of exponents m;7. In fact, g maps A; to 0 or oo according to whether m; is positive 
or negative and the fact that g is monic implies that g(oo) is either 0, 1, or co 
depending on whether the degree of g is negative, zero, or positive, respectively. 
Although it is not necessary for the proof, we will determine which of the above 
cases occurs for each element of S. 

The purpose of our particular choice of exponents m; made in (2.2.20) is to 
ensure that g satisfy (ii) as well. We shall try to motivate the choice of exponents 
later in the proof. 

Before proving (ii), we determine explicitly the values of g on elements of S. By 
Vandermonde’s Identity (2.2.14) and the fact that the 4; are increasing, we have that 


~ 


V(Aq,...,Aj,.-.,An) > 0 (2.2.21) 
for all i since every term in the factorization of V(A,,... ae. ...,An) is positive. 
Hence: 

eO5= if i =1 (mod 2) (2.2.22) 


if i =O (mod 2). 


Next we show )~’_, mj; = 0, which gives g(oo) = 1. Consider the auxiliary matrix 


11 ay a2... an 
11 A, a2... an? 

A=|. 00 OO (2.2.23) 
1 1 A, Az... Ane? 


Since the first two columns of A are identical, det A = 0. Expanding det A along 
the first column gives )°/_, mj = 0. 
Now we return to verifying (ii) for g. Consider the logarithmic derivative of g: 


n 


‘ mj 
- => (2.2.24) 


j=" 


Since the ramification points of g that are not above infinity are the roots of g’, the 
ramification points of g are contained in the set of zeros and poles of g’/g. 


Claim. We have the identity 


” i ms pees An 
3 IM (ryt And (2.2.25) 
L4+x—j; x i 


>This is true so long as m; # 0 for all i which holds by (2.2.14) and (2.2.18). 
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The important point is that the numerator on the right-hand side of (2.2.25) is a 
constant; its value does not matter for proving (ii). The choice of exponents that 
we made in (2.2.20) is what makes the numerator of the right-hand side of (2.2.25) 
constant; for an arbitrary choice of exponents, the numerator of the right hand side 
of (2.2.25) would be a complicated polynomial of degree n — 1. 

Before proving the claim, let us see why it implies that g satisfies (ii). Assuming 
the claim, we see that g’/g has simple poles at ;, a zero of order n at oo and no 
other zeros or poles. Hence any ramification point of g either lies above oo or lies 
in S. By (i) for g, the branch points of g are contained in {0, 1, co}, as was to be 
shown. 

We are left with proving the claim. By putting the left-hand side of (2.2.25) over 
a common denominator, the claim is equivalent to 


Yim | [@-4) = CD" 'VAL...,An)- (2.2.26) 
i=1 Hi 


We shall prove (2.2.26) in two steps: First we will show that the constant term of the 
left hand side is equal to the right-hand side and then we will prove the full identity 
by using a change of variables trick suggested by B. Mazur. Equality of constant 
terms in (2.2.26) states that 


Yom (I [Jay = CD" VAL... An). (2.2.27) 


i=l j#i 
We now use that A; = 0 to simplify (2.2.27). Ifi 4 1, then ILi«i A; = 0 since 
A; = 0,80 (2.2.27) becomes 
m | [ A; = V(u,...,An). (2.2.28) 
j#l 


Keeping in mind that 4; = 0, expand V(Ay,...,A,) along the first row: All but the 
first term are zero, so we get 


he. Fe sks. a 
Visayas £ t-2 | (2.2.29) 
Ree De ai 
te cox DE 
= [Agi 2 Se 3 (2.2.30) 
gee NN Ne. eg AE 


by multilinearity of the determinant. This proves (2.2.27). 
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We regard (2.2.27) as an identity of independent variables {A;}° and make the 
change of variables 


pa 2 ee (2.2.31) 
to get 
YEDIV x $A FAG et An) 
i=1 
] [Gx +4) = Vex + Ai,...,—x + An). (2.2.32) 


j#i 


We are now in a position to apply the translation invariance property of the 
Vandermonde determinant explained above. Applying (2.2.17) to (2.2.32) gives 


n 


SSCDITV OAL. Ais An) Pay) = EDV AL, An). (2.2.33) 


i=l i#i 
This proves the claim which completes the proof of Belyi’s theorem. oO 


Second proof of Theorem 2.4: This is Belyi’s original construction. The idea here 
is to reduce the size of the branch locus by one at a time, until the size of the 
branch locus is at most 3. Again set S = B(h) C P!(Q). We first show by 
induction on || that it suffices to construct g satisfying (i) and (ii) when |S| = 4. 
The crux of the argument is then to treat the case |S| = 4 by constructing g 
explicitly. 

Suppose there exists g satisfying (i) and (ii) for all S with |S| = 4. Let 
s € S and set U = S —s. By induction there exists a function gy : P! > P! 
satisfying (i) and (ii) for U ie, gy(U) C {0,1, 00} and B(gy) C {0,1, co}. Let 
T = {0, 1,00, gy(s)}. By our assumption on |S| = 4, there exists gr : P! > P! 
satisfying (i) and (ii) for T ie., gr(T) C {0,1,00} and B(gr) C {0, 1,00}. Now 
consider gs = gr © gy. By (i) for gr and gy, we have gs(S) C {0, 1,00} and we 
see from the multiplicativity of ramification indices that B(gs) C {0, 1,00}. This 
completes the induction. 

We now show that there exists g satisfying (i) and (ii) when |$| = 4. By using a 
fractional linear transformation, we may assume S' = {0, 1,00, 5} withO < 5 < 1. 


Note that s € Q. There exist unique positive integers m,n such that s = a and 
(m,n) = 1. We claim that 
m+n m+n 
g(x) = ee Gi — x)" (2.2.34) 
mn" 


3In other words, regard (2.2.27) as an identity in Q[A,,..., An). 


190 W. Goldring 


satisfies (i) and (ii). We will try to motivate this construction and in the process 
prove that it works. 

To begin with, it makes sense to consider functions given by polynomials of the 
form 


fag) =o =x)", (2.2.35) 


with c € Q a non-zero constant, as possible candidates for g. For all c 4 0 and 
k,€> 1, wehave f.xe(0) = foxe() = Oand f.x,¢(0o) = oo. For any choice of 
k, £, there will be a unique choice of c to make f...¢(s) = 1. This takes care of (i). 
As for (ii), the finite branch values of f.x,¢ are given by 


B( fe.) — {00} = {fe ke @| fen e(@ = 0}. (2.2.36) 
But 
foe) = ex* 1 (1 = x)" [kL — x) — €5], (22:37) 
so he has just one zero other than zero and one. The value of this exceptional 
zero depends on the exponents k, £ and is given by x = oy Hence we choose 
k = mand € = nso that the root of f/’, , different from 0 and | is [7 = s. The 
corresponding branch value is then f..m.,(s). As was said above, there is a choice 
of c, namely c = Ce a so that fomn(s) = 1. This is one way to see how to 
arrive at 
m+n 
g(x) = fere(x) with c = ee a k =mandl=n., (2.2.38) 
m™n" 


oO 


2.3. Remarks 


The most interesting point in both of Belyi’s proofs of Theorem 2.4 is that the 
exponents in the functions g satisfying (i) and (ii) are constructed from the elements 
of S, which are the branch values of h. These constructions use the fact that the 
branch values of / are integers (or rational numbers) together with the very special 
property of the ring Z that its elements are exponents of rational functions. By 
the latter we mean that, for example, f(x) = x*® is a rational function if and 
only if a € Z. This presents a major difficulty in generalizing Belyi’s theorem 
to other situations, such as the function field case, which we discuss in the next 
section. 
The converse of Belyi’s theorem is also true: 


Theorem 2.5 (Weil [Wei56], Converse of Belyi’s theorem). Suppose X is a 
connected, smooth, projective curve that admits a map p : X > P! with B(g) C 
{0, 1, oo}. Then X is defined over Q. 
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The converse of Belyi’s theorem follows from Weil’s descent theory. It is curious 
that, as far as we know, Weil did not ask whether the converse to Theorem 2.5 is 
true, thus conjecturing Belyi’s theorem, especially that Weil did so much pioneering 
work in the arithmetic and geometry of covers of curves. 

Theorem 2.5 is more clearly understood in terms of descent for the étale 
fundamental group, (part of) Grothendieck’s vast generalization of Weil’s theory, 
which can be found in [Gro71]. 


Theorem 2.6 (Descent for the étale fundamental group). Suppose L is an 
algebraically closed field of characteristic 0 and K is an algebraically closed 
subfield of L. Let S be a variety over K. Then 


n"(S) ~ "(5 @x L). (2.3.1) 


Define the Belyi degree of a curve X defined over Q to be the smallest degree 
of a Belyi map for X. Both proofs of Belyi’s Theorem give a bound for the Belyi 
degree of X in terms of the height of the branch values of the original function ho. 
The first proof presented here (Belyi’s second proof) is more efficient in the sense 
that it gives a better bound on the Belyi degree. 


Question 2.7. What can be said about the behavior of the function 


. connected, smooth, projective 
Belyi D 2 a yA 2.3.2 
ee | curves defined over Q = ( ) 
For example, how close is the bound on the Belyi degree given by Belyi’s second 
proof to the best possible? 


We know of only one lower bound for the Belyi degree, which was kindly 
communicated to us by Zapponi. To state his result, we need the notions of field 
of moduli and semistable reduction. Suppose X is a curve defined over Q. The 
Galois group Gal(Q/Q) acts on the set of isomorphism classes of curves defined 
over Q, and the stabilizer of the isomorphism class of X in Gal(Q/Q) is of the form 
Gal(Q/ K) for some number field K. The field of moduli of X is defined to be K. 
The field of moduli may also be described as being the intersection of all fields of 
definition for X.. The field of moduli of other geometric objects defined over Q, such 
asacover f : X —> Y of varieties, is defined analogously. The field of moduli is 
not always a field of definition, but in many cases it is, and understanding precisely 
when it is is a rather subtle problem, see [DD97]. 

Suppose X is a curve with field of moduli a number field K. Let p be a prime of 
K and let Ox, denote the ring of integers of the completion of K at p. We say that 
X has good (resp. semistable) reduction at p, if X admits a model over Spec Ox, 
whose special fiber is smooth (resp. a model whose special fiber has at worst nodal 
singularities and finitely many automorphisms). If X does not have good reduction 
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at p, then we say X has bad reduction at p. We say X has good (resp. semistable) 
reduction over a prime p € Zif X has good (resp. semistable) reduction at all 
primes of K lying above p. Finally, there is the question of how the reduction type 
of X changes as the base field varies. We shall use only that good and semistable 
reduction do not change if one passes to a larger field.* 

Again these definitions extend to covers. However it is important to note that, 
for example, X may have good reduction at a prime p of a field F, while a cover 
consisting of X and a map from X to P!, may have bad reduction at p. 


Theorem 2.8 (Zapponi, [Zap08a]). Let X be a curve defined over Q, with field of 
moduli K. Then the Belyi degree of X is greater or equal to the largest prime p € Z 
over which X has semistable bad reduction. 


Note added in Proof: This result can now be found in a preprint of Zapponi, “On 
the Belyi degree(s) of a curve defined over a number field”, available at http://www. 
math.jussieu.fr/~zapponi. 

The lower bound of Theorem 2.8 cannot be sharp since, for example, there exists 
infinitely many number fields F and elliptic curves E'/ F with E having everywhere 
good reduction over F. 

The proof of Theorem 2.8 is a simple application of a theorem of Beckmann 
[Bec89] relating the primes of bad reduction of X to the Galois groups of Belyi 
maps for X. In turn, Beckmann’s theorem may be obtained as a corollary of 
Grothendieck’s theory of the tame fundamental group. For Grothendieck’s theory 
of the tame fundamental group, see [Ser92] and [Gro71]. 


Theorem 2.9 (Beckmann, [Bec89]). Assume (X, ¢) is a pair consisting of a curve 
X and a Belyi map ¢ for X. Let M be the field of moduli of the pair (X,@). Let 
G be the Galois group of the Galois closure of the cover p : X —> P'. For every 
prime p € Z, if p does not divide |G|, then X has good reduction at primes of M 
above p. Moreover p is unramified in M. 


We do not give a proof of Beckmann’s Theorem, as that would take us too far 
afield. 


Proof of Theorem 2.8. Let g be a Belyi map for X of degree n. Let K (resp. M) 
be the field of moduli of X (resp. (X, ¢)). Then M is a (possibly non-trivial) finite 
extension of K. Let G be as in Theorem 2.9 and let p be a prime of K, above p € Z, 
where X has bad semistable reduction. Then X also has bad semistable reduction 
at some prime go of M, lying above p. Hence the pair (X, ~) has bad reduction at 
go. By Theorem 2.9, p divides |G|. But G <> S,,, the symmetric group on n letters. 
Now |S,,| = 1!, so |G||n!. Hence p|n!, so p <n. Oo 


4The deep result here, which we do not use, is the Semistable Reduction Theorem of Deligne- 
Mumford, which says that every curve acquires semistable reduction over some finite extension 
of the base field (see [DM69]). However this result is the reason why in Theorem 2.8 we say 
“semistable bad reduction” rather than simply “bad reduction”, which would have been a stronger 
statement. 
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3 The abc Polynomial Theorem 


Let K be an algebraically closed field of characteristic 0. Let f € K[x]. Then the 
radical R(f) of f is the product of the distinct irreducible factors of f. In other 


words 
R= |e (3.0.3) 
pif 
We denote the degree of the radical of f by r(/), ie., deg R(f) = r(f). The 
height h(fi,..., fr) of a finite set {fi,..., fn} C K[x] of polynomials is the 
maximum of the degrees: 


h(fi,.--, fn) = max{deg fi,...,deg fi}. (3.0.4) 


The abc Polynomial Theorem states:> 


Theorem 3.1 (Stothers [Sto81]-Mason [Mas84]). Suppose e, f, g € K|x] are not 
all constant, pairwise relatively prime and satisfy 


e+ f=g. (3.0.5) 


Then 
h(e, fig) <r(efg)—-1. (3.0.6) 


Theorem 3.2. Equality holds in (3.0.6) if and only if f/g is a Belyi map for P' and 
(f/8)(c0) € {0, 1, oo}. 


Proof. Seth = h(e, f,g) andg = f/g. Then deg gy = h. Further, we may assume 
without loss of generality that dege = deg f = h. There are then two cases, 
deg g < h or degg = h, which correspond to whether g(oco) = oo or not. Our 
assumptions on the degrees of e, f, g imply that in both cases y(oo) # 0, 1. By the 
Riemann—Hurwitz formula applied to ¢, 


~2=-2h+ Y (eg(x) — 1). (3.0.7) 


xep! 


The crucial observation now is that the branching behavior ¢ over {0, 1, co} is tied 
to the radical of efg. 
We break up the ramification divisor 


R= > (x) -1) (3.0.8) 


xeEP! 


>So called because it is the function field analogue which motivated the famous ‘abc Conjecture’ 
of Masser and Oesterlé (see [Oes89]). 
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as R = Ry 1,60} + Rpi_—10,1,00} with 


Roicy= >) (e(x)—1) (3.0.9) 
x€g—!({0,1,00}) 
and 
Rpi—0,1,00} = > ez) - 0. (3.0.10) 


x€g—!(P!—{0,1,00}) 
Now we focus on Ry9,1,99}. We have 


Y> @m-D= Yo e(@- Yo 1 (3.0.11) 


xeg—!(0) xe€g—!(0) xeEg—!(0) 
=h—|g™'(0)| =h-r(f). 


Similarly, since g(x) = 1 if and only if (g — f)(x) = e(x) = 0, we have 


Y> (p(x) — 1) =h-r(e). (3.0.12) 


xeg!(1) 


If g(x) = oo, then either x = oo or g(x) = 0. If deg g = h, then g(co) # ov, so 
as above 


Y> (g(x) — 1) =h=r(g). (3.0.13) 


x€g—!(o0) 


On the other hand, if deg g < h, then g(oo) = 00, so we have 


Y> (x) -1) =h-((g) + D. (3.0.14) 
x€y—! (oo) 
The extra contribution of | this time comes from x = oo, namely in this case 


|p~'(co)| = r(g) + 1. 
Putting together these calculations, we see that 


3h —r(e)—r(f)-r(g) if degg=h (3.0.15) 


Reoc0} = 
fetes 3h—r(e)—r(f)—r(g)—1 if degg <h 


By (3.0.7), 2h —-2=R and Rpi_o,1,90} = 0, 80 2h —2 > Rpo,1,00}. Hence (3.0.15) 
implies that 2h — 2 > 3h —r(e)—r(f)—r(g)—1,s0 


h<r(e)t+r(f)+r(g)-1. 
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Since e, f,g are pairwise relatively prime, r(efg) = r(e) + r(f) + r(g). 
This proves (3.0.6). We also see that equality holds in (3.0.6) if and only if deg g <h 
and Rpi_40,1,00} = 0. Now deg g < h means that g(oo) = 00 and Rpi_ 11,50} = 0 
means that ¢ is a Belyi map. This proves Theorem 3.2. oO 


4 Generalizations 


In this section, we present some theorems and conjectures which may be viewed 
as generalizations of Belyi’s Theorem in several different directions. In §4.1, two 
analogues of Belyi’s Theorem in characteristic p are given, which we call “The wild 
p-Belyi Theorem” (Theorem 4.1 ) and “The tame p-Belyi Theorem” (Theorem 4.6). 
It is interesting to note that, regarding the tame p-Belyi question, it is only a 
theorem for p > 2, while it remains an open problem for p = 2. Then §4.2 states 
Question 4.12, which is concerned with generalizing Belyi’s Theorem to function 
fields. Finally, §4.3 considers Belyi’s Theorem in the framework of moduli spaces 
of pointed curves. This is used to explain Braungardt’s Question (Question 4.17), 
which offers a generalization of Belyi’s Theorem to higher dimensions. 


4.1 Characteristic p 


Let k be a (not necessarily finitely generated) perfect field of characteristic p > 0 
and let g : C} —> C2 be a map of smooth projective curves defined over k. Recall 
that there is the dichotomy of tame and wild ramification in characteristic p. Given 
P € C,, the map ¢ is tamely ramified (resp.wildly ramified) at P if p Je,(P) (resp. 
p\ey(P)). For a divisor D ona curve C, the reduced divisor associated to D (resp. 
the support of D) will be denoted by Drea (resp. supp(D)). We let D = Do — Doo 
be the unique decomposition of D as a difference of disjoint effective divisors. 

We now state the first of two characteristic p analogues of Belyi’s Theorem, 
which we call “The Wild p-Belyi Theorem”. We give two proofs of this result, 
following [Kat88], [Zap08b]. See also [Kat88] for applications of the “Wild p-Belyi 
Theorem” to the Langlands correspondence for function fields. 


Theorem 4.1 (Wild p-Belyi, [Kat88], [Zap08b]). Let C/k be aconnected, smooth, 
projective curve. Then there exists a map 


gp: C —>P' with B(y) = {ov}. (4.1.1) 


In other words, C is birational to an étale cover of the affine line A‘. 


In case a map ¢ satisfies (4.1.1), we call g a wild p-Belyi map of C. The tame 
fundamental group of the affine line A! is trivial. Hence, if C is not isomorphic to 
P!, then any wild p-Belyi map of C is wildly ramified over 00, whence the name 
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“wild p-Belyi map”. On the other hand, this is one of the major differences between 
characteristic 0 and characteristic p —the full étale fundamental group of the affine 
line is a huge, non-finitely generated group. This may seem counter-intuitive at least 
at a superficial level, given our geometric intuition concerning A!(C). The Wild 
p-Belyi Theorem is one manifestation of the fact that, indeed, there are “many” étale 
covers of the affine line in characteristic p. The study of the (full) fundamental group 
of the affine line forms an interesting subject in its own right. One of the deepest 
conjectures in this area was Abhyankar’s Conjecture [Abh57], which was proved by 
Raynaud [Ray94] using many sophisticated techniques from algebraic geometry. 
For an introduction to the basic problems regarding nal) and Abhyankar’s 
Conjecture in particular, see the Bourbaki exposé of Serre [Ser92].° 


Proof I (after [Kat88]). The idea here is to use that, given a finite subgroup G of the 
k-points of the additive group scheme Gy, the natural action of G on A! satisfies: 


1L A/G SAL 
2. The projection map z : A' —> A'/G isa finite étale cover. 


3. Over k, when G is viewed as a subset of A!, the restriction of a to the 
complement A! — G, 


7|(41~G) : (A! ~ G) —=—- Gin. 


is a finite étale cover of Gj. 


By the Riemann-Roch Theorem, there exists a non-constant function f € K(C) 
which has only simple poles.’ Now f defines a map f : C —> P! which is 
unramified above oo by construction. Hence f is étale over A! — B(f). Let G be 
the subgroup of G,(k) generated by B(f). Now we use that char(k) = p > 0, to 
attain that G is finite, since G = spany ,{B (f)}. It follows from (1-3) above that 
the composite (7 o f) : C —> P! is étale over G,,. The following lemma allows 
us to conclude the proof. 


Lemma 4.2. The map 
Ht? Gm —> A (4.1.2) 
Pie OP +) 
is a finite étale cover of the affine line by the multiplicative group scheme Gy. 
Proof. The lemma follows from a simple computation that is left to the reader. O 


Indeed, (uo x 0 f) : C —> P' is étale over A!, which proves Theorem 4.1. Oo 


The only downside to [Ser92] is that it was written before Raynaud [Ray94] proved Abhyankar’s 
conjecture, so it is a bit dated and does not include the most recent developments in the field. 

fn >g+ land P,,..., P,, are distinct points on C, then the divisor D = P, + --: P,, satisfies 
{(D) = deg D—g+1+€(K — D) > deg D — g + 1 > 2 by Riemann-Roch, so one can find 
the desired function in £(D) = H°(C, Oc (D)). 
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Proof 2 (after [ZapOSb]). Zapponi obtains Theorem 4.1 as a consequence of a more 
general result relating the existence of maps to P! with restricted ramification to the 
existence of exact rational differentials with restricted divisor type. Zapponi’s result 
is the following: 


Theorem 4.3. Let S be a non-empty finite subset of C. Then there exists a finite 
étale map (C — S) —> A! if and only if there exists an exact rational differential w 
whose divisor’s support is contained in S. 


One source of motivation for such a connection to exist is the following simple 
fact. 
Lemma 4.4. Suppose C is a connected, smooth, projective curve and f € K(C)— 
K(C)? is a rational function that is not a p-th power. Then the ramification divisor 
R(f) of the map f : g : C —> P! defined by f is related to the divisor of the 
differential df by 


Theorem 4.1 follows immediately from Theorem 4.3. Indeed, let f €¢ K(C) bea 
rational function that is not a pth power, i.e., f ¢ K(C)?. Putw = df and let S be 
the support of w. By construction w is a non-zero exact differential whose support is 
contained in S. By Theorem 4.3, there exists a finite étale map @ : (X¥ —S) —> Al. 
Now @ extends to a map 9 satisfying (4.1.1). 

To complete Zapponi’s proof of Theorem 4.1, it remains to prove Theorem 4.3. 
Following Zapponi, we first prove a general lemma about exact differentials on 
curves and then use the lemma to finish off the proof of Theorem 4.3. 


Lemma 4.5. Let w € On be a non-zero exact rational differential. Then for all 
P €C there exists a function g € K(C), satisfying dg = w, whose set of poles is 
contained in the union of { P} and the set of poles of w. 


Proof. Let T be the union of {P} and the set of poles of w. Since w is exact and 
non-zero, there exists gy € K(C) — K(C)? with dgo = w. The argument is now to 
show, by induction on the degree of the part of (g0)oo disjoint from 7, that one can 
modify go by h € K(C)? so that go + h satisfies the theorem. Note that char(k) = 
char(K(C)) = p implies dh = 0 for all hh € K(C)?. 

Let Z be the set of poles of go. If Z C T, there is nothing to prove. So assume 
there exists Q € Z —T. Putn = —ordg(go), son € Z*. Now n is divisible by 
P, Since otherwise w would have a pole at Q, contrary to O ¢ T. Write n = pm, 
with m € Z* and consider the divisors D = (2g —1)P +m@Q and D — Q. Both D 
and D — Q are non-special, since deg(D — Q) = [((2g — 1) +m) — 1] > 2g —2. 
That is (kc — D) = &(kc — (D — Q)) = 0, with kc the canonical class of C. 
Applying the Riemann—Roch Theorem to D and D — Q and comparing the results 
yields ((D) —£(D — Q) = 1. Hence there exists u ¢ £(D) — L(D — Q). Therefore 
there is a non-zero constant c € k* such that —ordg(go — cu”) <n. Put ho = cu? 
and g; = go — Mo. Then hy € K(C)? and dg; = dg = w. Furthermore,the degree 
of the part of (g1)oo disjoint from T is strictly smaller than the degree of the part of 
(Zo)oo disjoint from 7. By induction, there exist and g as desired. o 
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Proof of Theorem 4.3. (<==). Suppose fo € K(C) — K(C)? and supp(dfo) C S. 
Put w = dfo. Choose P ¢€ S and let T be as in the proof of Lemma 4.5. By 
Lemma 4.5, there exists fp € K(C) — K(C)? such that dfp = w and fp is regular 
outside T. We now modify fp to get a primitive of w which has a pole at every 
element of S (fp may be regular at some point(s) of 5). The map to P! determined 
by this primitive will be the map we are looking for. 

Let m be a positive integer and consider the divisor Dj = mP + Yoe(s—P) Q. 
As long as m => 2g + 1, the Riemann—Roch Theorem guarantees that there exists 
a function g such that (g) = D2 — Dj, with D» an effective divisor. For c € k*, 
consider the functions he = fp + cg?. For all O € S, g? has a pole at Q, so 
there is at most one value of c for which h, is regular at Q. Since S is finite and k* 
is infinite, there exists c € k* such that h, has a pole at every element of S. Also, 
dh, = w. Now think of h, asa map h, : C —> P!. By construction, h.(S) = {oo}, 
so h, is étale over A! by Lemma 4.4. 

(=>). Conversely, assume f : (X — S) —> Al! is finite étale. Let ¢ be the 
usual uniformizing parameter on A!, so that (t) = (0) — (©) on P!. Let w be the 
differential of the pull back of ¢ from P! to X by means of /, i.e., @ = d(f*(t)). 
By construction @ is exact. Since differentiation commutes with pull backs, @ = 
f* (dt). Hence supp(w) C f*(supp(dt)) = f*({oo}). So @ satisfies the properties 
required by Theorem 4.3. oO 


Having proved Theorem 4.3, the proof of Theorem 4.1 is now complete. oO 
We now come to the tame p-Belyi Theorem. 


Theorem 4.6 (Tame p-Belyi, [Sai97]). Let p be an odd prime. Let X be con- 
nected, smooth, projective curve defined over F,. Then there exists a tamely ramified 
morphism 

g:X—P! (4.1.4) 


with 
B(g) C {0, 1, oo}. (4.1.5) 


If g is a tamely ramified morphism that satisfies (4.1.4) and (4.1.5), we call g a 
tame p-Belyi map of C. 

Saidi’s proof amounts to the observation that Theorem 4.6 is a corollary of 
the following lemma, which was known to F. Klein and A. Hurwitz, but was first 
rigorously proved in all characteristics different from two by Fulton [Ful69]. 


Lemma 4.7 (Fulton [Ful69], p.569, Prop. 8.1). Let p be an odd prime and let k 
be an algebraically closed field of characteristic p. Then every connected, smooth, 
projective curve X over k admits a simply branched morphism to P', i.e., there 
exists 


vw: X — P! with ey(P) = 1 or2 forall P € X. (4.1.6) 


Proof of Theorem 4.6. Take k = Fp Let y : X —> P! be a simply branched 
cover, as provided by Lemma 4.7. Since p is odd, y is tamely ramified. Also, B(y) 


Unifying themes suggested by Belyi’s Theorem 199 


is a finite subset of P'(F,,), so there exists an integer m such that B(W) C Fm. 
Consider the function f given by 


ae a 


freee, 


and form the composite g = f o w. Now ff is totally ramified at 0 above 0, at 
oo above oo and unramified everywhere else. Since es (0) = e¢(oo) = deg f = 
p’” —1, the map f is tame. Also, f(b) = 1 for all b € B. By Lemma 2.1, g 
satisfies (4.1.5). oO 


The above proof is not applicable in the case p = 2, but the result should still 
hold for p = 2. 


Conjecture 4.8 (Tame 2-Belyi). Theorem 4.6 is also true for p = 2. 


There is only a partial result known in the direction of Conjecture 4.8 (see 
Theorem 4.10 below). It has been asked whether the proof of Theorem 4.6 could 
be modified so as to work in characteristic two by using triple ramification instead 
of double (simple) ramification. More precisely: 


Question 4.9. Suppose k is an algebraically closed field of characteristic different 
from three. Does every curve X over k admit a map 


1:X —P'! with e,(P) = 1 or3 forall P € X ? (4.1.7) 


Note that, in case char(k) = 2, if a map T satisfying (4.1.7) exists for a curve X, 
then t is tamely ramified, and so, by the proof of Theorem 4.6, Conjecture 4.8 holds 
for X. Thus a positive answer to Question 4.9 would imply Conjecture 4.8. 

In this direction, Schroer [Sch03] has given a lower bound for the dimension of 
the locus, in the moduli space M., of curves of genus g, of those curves admitting 
a map T as in (4.1.7). 


Theorem 4.10 (Schroer [Sch03]). Let k be an algebraically closed field of char- 
acteristic # 3. Define Tri(M.,) to be the closure of the locus of curves X that admit 
amap t as in (4.1.7). Then, 


1. dimTri(M,,) = 1. 
2. dim Tri(M,) > 2g — 3 for g > 2. 


As a final piece of evidence to support Question 4.9, we mention the following 
result regarding the case k = C. 


Theorem 4.11 (Artebani-Pirola, [AP04], p.339, Th. 2). The Locus Tri(M.,)(C) 
is a Zariski open subset of M,(C) 


The proof of Artebani—Pirola uses complex analysis, so it is not clear whether 
their method can be modified to prove the same statement for any field k, char k 4 3. 
Still, we believe that Theorem 4.11 shows at least that Question 4.9 is reasonable. 
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In this section, we search for a Belyi-type theorem for curves defined over Q(t). 
Let X be a curve defined over Q(t). By the converse to Belyi’s Theorem, if X is 
generic, then there is no map gy : X — P! with B(~) C {0,1, co}. By Belyi’s 
Theorem, there therefore cannot even be such a map with B(y) C Q. However one 
can ask: 


Question 4.12. For every curve X defined over Q(t), does there exist a map 
gy: X — P' with |B(g)| = 4? (4.2.1) 


For a more general question along the lines of Question 4.12, see [Par02]*. 

We illustrate the nature of this question with two examples, one for which the 
question can be seen to have a positive answer, the other for which the answer is 
unknown. 


Example 4.13. Elliptic curves over Q(t). 


Let E/Q(t) be an elliptic curve. Then the hyperelliptic degree 2 map E —> P! 
(given, for instance, by quotient by the hyperelliptic involution) is branched over 
2¢(E) + 2 = 4 points so the answer to Question 4.12 in this case is yes. 


Example 4.14. Genus two curves over Q(f). 


Let C/Q(t) be a genus 2 curve, which is necessarily hyperelliptic. In this case I 
do not know the answer to Question 4.12. We can consider again the hyperelliptic 
degree 2 maph: C — P!. This time |B(h)| = 2g(C) + 2 = 6. Taking the proof 
of Belyi’s Theorem as our guide, we may ask: 


Question 4.15. Does there exist g : P! — P! such that | B(g 0 /)| = 4? That is, is 
there a way of reducing the number of branch points from 6 to 4 in this case? 


4.3 Higher Dimensions 


The purpose of this section is to explain a question of V. Braungardt [Bra04] 
(see Question 4.17), which suggests a generalization of Belyi’s theorem to higher 
dimensional varieties.” The origin of Braungardt’s question is the observation that 
the space P! — {0, 1, oo} appearing in Belyi’s theorem is a moduli space of curves: 


8We thank the referee for bringing this reference to our attention. 

°A generalization of Belyi’s theorem for surfaces in a different direction than the one considered 
here can be found in [Par02]. Since we believe the generalization in [Par02] is less natural and of 
a more technical nature, we have chosen not to state it here. 
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It is the moduli space of genus 0 curves with 4 ordered marked points, since any 3 
points in P! can be mapped to {0, 1, oo} by a fractional linear transformation, unique 
up to permutation of {0, 1, oo}. 

Thus we open this section with a brief discussion of moduli spaces of curves 
with marked points. Then we state Braungardt’s Question. This is followed by an 
explanation of partial results in low dimension, some remarks on high dimension 
and a note on fundamental groups of hypersurface complements. 


Moduli spaces of curves. Let M,,, (resp. Mg jn)) denote the moduli space of 
genus g curves with n ordered marked points (resp.with a subset of cardinality 7). 
The moduli problem describing M,,, is not representable in the category of 
schemes when 77 is sufficiently small compared to g. Moreover, the moduli problem 
describing M g J] is never represented by a scheme. When the moduli problem is not 
representable, there does not exist a fine moduli space in the category of schemes. 
The principal obstruction to the representability of a moduli problem is (lack of) 
rigidity. This means that if the objects being classified by the moduli problem 
have non-trivial automorphisms, then the moduli problem is not representable in 
the category of schemes. For all values of g and nm other than the three pairs 
(g = l,n = 0), (g = 0,n = 1), (g = 0,n = 2), there does exist a scheme 
which is a coarse moduli space for genus g curves with n marked points. 

However, our interest in the moduli spaces of curves lies with their covers and 
fundamental groups. For this aspect of moduli spaces of curves it is essential to 
work with fine moduli, since passing to a coarse moduli space often annihilates 
much of the fundamental group. For this reason we find ourselves forced to view 
Men (resp. Mg nj) as a fine moduli algebraic stack and to consider 1;(Mg¢) as 
the stack fundamental group. 

The theory of stacks, moduli spaces and their representability in various cat- 
egories is very deep and sophisticated. Since we want to keep the exposition as 
self-contained as possible, while at the same time not veering off course to discuss 
unnecessary technical issues, we proceed as follows. We recall what it means for 
a map of algebraic stacks to be étale in terms of schemes, which suffices, as far 
as stacks are concerned, for understanding this paper. Then we illustrate some of 
the points made above with an example: The moduli space of elliptic curves Mj.1, 
which will also reappear in §5. For more on moduli problems we refer to the three 
classic works [KM1], [DR73] and [DM69]. 

Amap yg: X — J of algebraic stacks is étale if, for every pair (S, w) consisting 
of ascheme S anda mapa: S —> J, the pull-back S xy ¥ of X to S is a scheme: 


fag es 


| |e. (4.3.1) 


S 22% 
For example, one can check that the map M,, —> Mg nj, given by forgetting 


the ordering, is étale as a map of algebraic stacks. For this reason, we will for the 
most part restrict attention to Mg jn), rather than M,.,, from this point on. 
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Example 4.16. The Moduli space of Elliptic Curves. In terms of the above, the 
moduli space of elliptic curves is the moduli space M,,; of genus | curves with 
1 marked point (the 1 marked point being the origin, i.e., the identity for the 
group law). It is the quintessential example of a moduli space, in many ways the 
raison d’etre of moduli theory. Every elliptic curve has a non-trivial automorphism, 
namely the map called —1 which sends an element x to —x. The existence of non- 
trivial automorphisms of elliptic curves shows that the moduli problem for elliptic 
curves is not rigid and hence not representable: There does not exist a fine moduli 
scheme for elliptic curves. 


There are two ways to recover a fine moduli space. One way is to consider the 
moduli space of elliptic curves as a stack, as was mentioned in the general paragraph 
above. The other way is to impose level structure. This means that one modifies the 
moduli problem to that of classifying pairs (FE, L) where E is an elliptic curve and L 
is some additional structure arising from E, such as a basis of the N-torsion points 
of E for some positive integer NV. Adding level structure to the moduli problem is a 
way of “rigidifying” the moduli problem (eliminating non-trivial automorphisms), 
which is then often sufficient for proving that the modified moduli problem is 
representable and thus admits a fine moduli scheme. 

For example, if instead of classifying elliptic curves, we aim to classify pairs 
(E,L), with L a basis of the N-torsion of E, then for N > 3 there exists a fine 
moduli scheme for this moduli problem: It is the modular curve denoted Y(NV). 

At the same time, there does exist a coarse moduli scheme for elliptic curves, 
called the j-line. The j-line is isomorphic to the affine line A! by means of the map 
j which sends an isomorphism class of elliptic curves to its j-invariant. Over the 
complex numbers, the coarse moduli scheme of elliptic curves is also isomorphic to 
SL(2,Z) \ Hand the 7 line can be seen as giving a complex-analytic isomorphism 
of (open) Riemann surfaces 


{28LQ,Z)\ HSC. (4.3.2) 


Since the coarse moduli space of elliptic curves is isomorphic to Al, it is simply 
connected. By means of (4.3.2), we see that SL(2, Z) \ His also simply connected. 
The (topological) fundamental group of SL(2, Z) \ H is not SL(2, Z) because the 
action of SL(2, Z) on the upper half-plane H is not free. However if we consider 
SL(2,Z) \ H as a stack over C, or as an orbifold, then the stack or orbifold 
fundamental group of SL(2, Z) \ His SL(2, Z) (cf. [BN06]). 


Braungardt’s Question. The main question of this section is: 


Question 4.17 (Braungardt, [Bra04] ). Is every connected, quasi-projective vari- 
ety X that is defined over Q birational to a finite étale cover of some moduli space 
of curves M ¢ fn)? 


There are only a few limited partial results regarding Braungardt’s Question, 
which we now describe. 


Dimension 1. Belyi’s theorem gives a complete affirmative answer in the case 
that X is a curve. Indeed, every connected, quasi-projective curve is birational to 
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a connected, smooth projective curve and by Belyi’s Theorem every connected 
smooth projective curve defined over Q is birational to a finite unramified cover 
of P! — {0, 1,00} & Moa. 


Dimension 2. The moduli spaces M gj, of dimension two are Mos) and M19). 
There is an étale cover of algebraic stacks 


B 
M12) — Moy] (4.3.3) 


which we now describe. Let (£; {¢1, g2}) represent a point 7 € My, 2). Then, up 
to fractional linear transformation, there is a unique degree 2 map yg : E —> P! 
satisfying gz (q1) = GE (G2). We define B(n) € Mo,5] to be the point represented by 
(P!;{r,,...,rs}), where B(g) = {r1,...,r4} and rs = 0(q1) = ¢(q2). Therefore 
in dimension 2 Braungardt’s question is equivalent to: Is every connected, smooth 
projective surface birational to a finite étale cover of Mos? 

Here is a concrete description of Mo,5 as a subvariety of P! x P!. Let 


S = [(P! — {0, 1, 00}) x (P! — {0, 1, 00})] — A, (4.3.4) 


where A denotes the diagonal of the product. Then Mos & S.A point (a,b) € S 
corresponds to the 5-pointed projective line (P!; 0, 1, 00, a, b). 

An embedding Mo, <> P? is given explicitly in the last paragraph of this 
section. 


Summary of results of Braungardt on his question for surfaces. Braungardt has 
shown that his question has a positive answer for some special classes of surfaces. 


Theorem 4.18 (Braungardt, [Bra04]). 7. All abelian surfaces'® defined over Q 
are birational to finite etale covers of Mo,s}- 

2. All ruled surfaces, all fiber bundles of fiber genus = 2 and all elliptic fibrations 
defined over Q are birational to finite étale covers of Mos. 

3. If there is a positive answer to Question 4.12, then every smooth irreducible 
surface fibered over a curve with connected fibers is birational to a finite étale 
cover of Mos. 


From the classification of surfaces, we see that the surfaces not treated in 
Theorem 4.18 are the non-elliptic K3 surfaces and surfaces of general type. To the 
best of our knowledge, Braungardt’s question is open in these cases. 

We only give a proof of the assertion about abelian surfaces. However, note that 
the proofs of the other parts of Theorem 4.18 are similar in that they use the way the 


'0Tn this paper abelian surface will always mean algebraic abelian surface. 
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given surface is related to some curve to reduce to Belyi’s theorem for that curve. 
Indeed, observe that the surfaces covered by Braungardt are exactly those whose 
geometry is controlled by some underlying curve. 


Proof of Theorem 4.18 (1). The essential ingredient in the proof is a well-known 
dichotomy for abelian surfaces: Every principally polarized abelian surface is either 
a product of two elliptic curves or is the Jacobian of a genus 2 curve. In fact, every 
principally polarized abelian surface is the Jacobian of its theta divisor. Furthermore, 
every abelian surface over an algebraically closed field of characteristic 0 is 
isogenous to an abelian surface that admits a principal polarization. One can also see 
that the relevant dimensions happen to coincide. The dimension of A,, the moduli 
space of abelian varieties of dimension g is g(g + 1)/2. On the other hand, for 
g = 2, we have dim M, = 3g —3.SodimA, = dimM, if and only if g = 2 
or g = 3. The comparison of dimensions also shows why this proof for abelian 
surfaces cannot generalize to abelian varieties of dimension greater than 3. 

Let A be an abelian surface defined over Q. By composing with an isogeny 
(which is finite and everywhere unramified) we may assume, without loss of 
generality, that A admits a principal polarization. 


Case I: A is a product E, x E> of two elliptic curves. Since A is defined over Q 
also E, and E> are defined over Q. Applying Belyi’s Theorem to E; and E gives 
Belyi maps a : E; > P! and B : E, — P!. Putting together the maps a, B gives a 
map vy: A > P! x P' given by 


eas 1 I 
w:A—P xP. (4.3.5) 


Now, since a and f are Belyi maps, their branch loci are contained in {0, 1, co}. 
Hence y restricts to a finite unramified cover w : y~!(S) + S, where S is the 
image of the embedding of M5 into the quadric surface P! x P! defined in (4.3.4). 
Since the complement of $ in P! x P! is a proper closed subvariety (it is a union of 
lines) so is the complement of y—!(S) in A. Hence w~!(S) is an open subset of A, 
so w—!(S) is birational to A. 


Case II: A is the Jacobian of some genus 2 curve C. The Jacobian of C is birational 
to the symmetric square Sym?(C) of C. Since A is defined over Q, C is defined 
over Q too. Let g : C — P! be a Belyi map for C. By functoriality of symmetric 
powers, there is an induced map Sym?(y) : Sym*(C) — Sym?(P!), whose branch 
locus is contained in Sym*({0, 1, oo}). Hence the map Sym*(¢) is unramified over 
the quotient U of Mos by the transposition that interchanges the fourth and fifth 
marked points. Now U is a cover of Mo,s}, so A is birational to a finite unramified 
covering of Mo 5). oO 


Some remarks on dimension at least six. Given the content of Braungardt’s 
Question, it will be convenient to have the following notation. Given a variety V, 
let buc(V) denote the set of all varieties W such that W is birational to a finite, 
unramified cover of V. If V and W are two varieties satisfying both V € buc(W) 
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and W € buc(V), then we will say that V and W are BUC, written also VV ~guc W. 
On the other hand, if V and W satisfy both V ¢ buc(W) and W ¢ buc(V), then we 
will say that V and W are Ibuc,'! or that V is Ibuc with W. 

In terms of this notation, Braungardt’s Question may be rephrased as 


ie buc( Mj) = VARS, (4.3.6) 
{(g.n)| dim M g.n=k} 


where VAR& is the set of varieties of dimension k defined over Q. 


Conjecture 4.19. 1. The moduli space Mz of curves of genus 3, is Ibuc with the 
other moduli spaces M g jn] of curves of dimension 6, namely MoJo}, M16 and 
M2,3}. 

2. More generally, assume (g1, 71) and (g2, 22) are two distinct pairs of nonnegative 
integers satisfying 3g; —3 +n, = 3g2-—3+ no. Then Meg, jn,) and Mog, fn5] are 
Ibuc if and only if {g1, g2} Z {0, 1, 2}. 


The “Only if” part of Conjecture 4.19 is true. This can be seen by using the degree 
2 hyperelliptic maps to P! that genus 1 and genus 2 curves admit to construct maps 
between the moduli spaces, of the same dimension, of curves of genus 0,1 and 2 
with marked points. This was done above explicitly in the dimension 2 case. 

The moduli space of curves M, is of general type for sufficiently large g 
[HM82]. On the other hand, the moduli spaces Mo,, are rational, since Mo, 
embeds into P! x --- x P! (n — 3 copies) as a Zariski open set, via the natural 
generalization of (4.3.4), namely 


Mon o> Pl x... x Pl 


4.3.7 
(P!; pi,..., Pn) > (y(pa),-.-, Yn), oe 


where y € Aut(P') is the unique element satisfying y(p1) = 0, y(p2) = 1 and 
y(p3) = oo. This shows that, if g is sufficiently large, Mo, ¢ buc(M,). We are 
not aware of any other results regarding Conjecture 4.19. 

As J. Harris pointed out to us, there are several known instances in algebraic 
geometry where proving the non-existence of birational maps has proved to be a 
daunting task. For example, the general cubic hypersurface in P° is conjectured to be 
not rational, yet it has not been proved that a single such hypersurface is irrational. 

Suppose for a moment that the answer to Braungardt’s Question is “yes” and that 
Conjecture 4.19 holds true. This assumption has a surprising consequence. Namely, 
it would entail that to every variety V defined over Q is attached a discrete invariant, 
namely the non-empty finite set 


mc(V) = {(g,n) € Zso X Zso[V € buc(Mg fnj)}. (4.3.8) 


If this is indeed the case, then it would be interesting to characterize the invariant 
mc(V) independently of moduli spaces of curves. 


‘eT for incomparable. 


206 W. Goldring 


Fundamental groups of hypersurface complements. Let H be a hypersurface in 
a projective space P’””. It is a basic problem to understand the fundamental group 
z(P” — #) in terms of the geometry of H. When m = 1, A is a finite set of 
points, and it is classical that, as an abstract group 2(P!(C) — H) is the free group 
on |H|—1 generators. Although the general problem goes back to Zariski, when m 
is greater than one, it remains mysterious. As far as we know, the only general result 
about the fundamental group of the complement of a hypersurface in projective 
space is the following theorem of Fulton—Deligne in the case m = 2. 


Theorem 4.20 (Fulton—Deligne, [Ful80], [Del80]). Let C ¢ P?(C) be a nodal 
plane curve, i.e., either C is smooth, or the only singularities of C are nodes (also 
known as ordinary double points). Then the fundamental group 1 (P?(C)—C ) of the 
complement of C in P?(C) is abelian. More precisely, if the irreducible components 
of C have degrees d,,..., dg, then 


m(P?(C) — C) & Z/d\Z x +++ x Z/d,Z. (4.3.9) 


Fulton [Ful80] proved that the algebraic fundamental group 7/"*(P2(C) —C) (which 
is the profinite completion of the topological 2;(P?(C) — C)) is abelian. Then 
Deligne used Fulton’s results to deduce (4.3.9) and also gave an exposition of 
Fulton’s work in [Del80].!? 

The connection between fundamental groups of hypersurface complements and 
Braungardt’s question is that in certain cases Mz, or Mg Jn] can be realized as 
P” — H, for some hypersurface H. In fact, for Mo,, this can be done with m = 
n —3 and H a finite union of hyperplanes. For Mo,4, this is the above mentioned 
isomorphism Mo,4 = P! — {0, 1,00}. More generally, one has. 


Theorem 4.21. Supposen > 1. Let S = {pi,..., Pn+2} be a collection of n + 2 
points in general position in P". Let H be the union of the re hyperplanes 
spanned by subsets of S of cardinality n. Then 


Mont3 = P" — H. (4.3.10) 


In particular, setting n = 2 gives: 


Corollary 4.22. Mo,5 is isomorphic to the complement, in P?, of any 6 lines 
through 4 points, no 3 of which are collinear. 


In view of Corollary 4.22, we see that an affirmative answer to Braungardt’s 
question in dimension 2 would not contradict the Fulton—Deligne Theorem: The 
union of 6 lines through 4 points in general position is not a nodal curve; its singular 
locus consists of 4 ordinary triple points and 3 ordinary double points. 


PAs is explained in [Ful80] and [Del80], Zariski gave a proof of Theorem 4.20, but it was 
incomplete because it relied on a flawed argument of Severi. 
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Proof of Theorem 4.21. Let K be the hyperplane in P” spanned by pi,..., Pn 
and let L be the line spanned by p,+1 and p,+2. Since S is in general position 
K intersects L in a point; call this point z. Now let P € P” — H. We will 
associate to P a sequence g(P) = (¢1(P),...,@n(P)) of n distinct points lying on 
L —{Pn+1, Pn+2,z} in such a way that the map 


P"—H — Mon+3 


@: 
P = (L; Pn+1, Pn+2: 4; Q\(P),..-,@n(P)) 


(4.3.11) 


is an isomorphism. 

To define y;(P), consider the set S;(P) = (S — {p;}) U {P}. Since S is in 
general position and P ¢ H, the set S;(P) spans a hyperplane K;(P) in P”. Using 
again that P ¢ H, it follows that K;(P) intersects L in a point and that that point 
is distinct from py+1, Pn+2 and z. Set y;(P) to be the unique point of intersection 
of K;(P) and L. 

Now it is straightforward to check that g;(P) ¢ g;(P) for alli  j, so that ® 
is well defined, and also that © is a bijection. 

If g:(P) = 9j(P) andi # j, then g(P) € K;(P)M K;(P), so (S:(P) — pj) 
U o(P) = (S;(P) — pi) U g;(P). This yields an equality of hyperplanes 
K\(P) = span(S\(P) — pj) U g(P) = span(S;(P) — pi) Ug (P) = 
K;(P). But K;(P) = K;(P) implies K = K,;(P) = K;(P), so P € K, 
contradiction. 

To see that ® is injective, note that for all P € P”—#H, then hyperplanes K;(P), 
1 <i <n, are in general position, so they intersect in a single point, but P € K;(P) 
for alli, soM_,K;(P) = {P}. Thus g(P) = g(Q) implies {P} = N'_, Ki(P) = 
n'_,Ki(Q) = {Q}, so P = Q. Finally, if (y1,...,¥n) are n distinct points in 
L—{Pn+1, Pn+2, 2}, then the n hyperplanes span{S; U y;}, 1 <i <n, intersect in 
a point, call it P. By construction ®(P) = (L; pn+1, Pnt+2,Z, V1;--+> Yn). Hence ® 
is surjective. 

It is apparent from the construction of ® that both ® and ©! are morphisms, so 
that ® is in fact an isomorphism." Oo 


5 Modularity of elliptic curves and a question of Khare 


In this section we explain a question of Khare ([Kha04]). There are two results that, 
when viewed in conjunction, lead naturally to Khare’s question. The first result is 
an equivalent formulation of Belyi’s Theorem, while the second is the modularity 
of elliptic curves over Q (see Lemma 5.1 and Theorem 5.2 below). 


'3An alternative proof of Theorem 4.21 can be gotten by using (4.3.7), followed by a Segre 
embedding, to embed Mo, into a large projective space. Then one projects onto a suitably chosen 
hyperplane. The composite is the desired embedding. We leave the details to the reader. 
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5.1 Subgroups of SL(2,Z) 


To state these results we recall some terminology about subgroups of SL(2, Z). Let 
N = 1 be an integer. Consider 


T(N) = ker(SL(2, Z) > SL(2,Z/NZ)). (5.1.1) 


A subgroup T' of SL(2, Z) is called a congruence subgroup if T contains (NV) for 
some NV. An important class of congruence subgroups is given by the subgroups 
I'9(NV ), which are defined by 


b 


To(N) = iC a € SL(2,Z)|N divides c}. (5.1.2) 


A congruence subgroup is necessarily a finite index subgroup. The converse is not 
true.'* Given a finite index subgroup I’ of SL(2, Z), the congruence hull P’° of T is 
the intersection of all congruence subgroups containing I’. The quotient I’ \ H has a 
compactification obtained by adding finitely many points to  \ H. We will denote 
the compactification of T \ H by [ \ H. When [ = [o(N), we follow standard 
notation by writing [o(V) \ H = Yo(NV) and Po(V) \ H = Yo(NV) = Xo(N). 


5.2 Reformulation of Belyi’s theorem as uniformization 


The following lemma is an equivalent formulation of Belyi’s theorem in terms 
of quotients of the upper half-plane by finite index subgroups of SL(2, Z). It is 
easy to prove and has been observed, in various slighly different forms, by many 
people. Nevertheless, this reformulation of Belyi’s theorem plays an important 
conceptual role, and therefore should not be underestimated, as explained by Mazur 
in [Maz91]. 

Let 


a 


free (4) = 6 a €T(2) d =1 (mod 4); . (5.2.1) 


Then free (4) is a level 4 congruence subgroup, 1.e., Ufree(4) D T'(4). 


Lemma 5.1. Let X be a connected, smooth, projective curve defined over Q. Then 
there exists a finite index subgroup V of Vfyee(4) such that 


X2P\ (5.2.2) 


'4TIndeed, in a sense which can be made precise, “most” finite index subgroups of SL(2, Z) are not 
congruence subgroups. (See [LS03] for a discussion of this topic.) 
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Proof. The equivalence of Lemma 5.1 and Belyi’s theorem (Theorem 1.1) is a 
simple application of the Galois theory of covering spaces, coupled with the 
observation that Iye¢(4) acts freely on H and Tiree (4) \ H & P! — {0, 1, oo}. 
Let us indicate why the latter holds. One has —1 € (2), —1 ¢ free and [I'(2) : 
Tfree(4)] = 2. Hence 
T(2) = {+1} x Tfree (4). (5.2.3) 


The stabilizer of every point in H for the action of (2) is {+1}, so (5.2.3) implies 
that Tee acts freely on H. Now it is elementary and classical that (2) has three 
cusps, which can be mapped to {0, 1, oo}, so F(2) \H & P! — {0, 1, oo}. However, 
by (5.2.3), the action of ['(2) factors through that of Tyee(4). Therefore Pee (4) \ 
H = P! — {0, 1, 00}. In particular, this observation yields that 7; (P! — {0, 1, o0}) = 
Vireo. 

Next, we prove that Belyi’s theorem implies Lemma 5.1. Suppose X is a curve 
defined over Q. By Belyi’s theorem there exists a Belyi map g : X — P!. This 
means that the restriction gjy :U > P! — {0, 1, oo} is a finite unramified covering, 
where U = g7'!(P! — {0, 1, 00}). Since P! — {0, 1,00} & Thee(4) \ H and 2, (P! — 
{0, 1,00o}) & Vhee, the Galois theory of covering spaces implies that 7,(U) is a 
finite index subgroup of Tfree(4) and 2\(U) \H & U. By compactifying we see that 


X ~ 7 (U)\H. (5.2.4) 


Conversely, suppose that for some T C Tfree(4) we have (5.2.2). The inclusion 
T <> I'free(4) induces a covering P \ H > Pee(4) \ H, which extends uniquely to 
a Belyi map X¥ > P'. o 


It is interesting to compare Lemma 5.1 with the classical uniformization theorem 
that every curve of genus at least two is uniformized by the upper half-plane, in 
the sense that such a curve is a quotient of the upper half-plane by some discrete 
subgroup of SL(2,R). 


5.3 Modularity of Elliptic Curves over Q 


The following theorem is the Shimura—Taniyama—Weil Conjecture, proved by Wiles 
[Wil95], Taylor-Wiles [TW95] in the semistable!® case and completed in full 
generality by Breuil-Conrad—Diamond-—Taylor [BCDT01]. 


Theorem 5.2 (Modularity of elliptic curves over Q). Suppose E is an elliptic 
curve defined over Q. Then there exists an integer N (called the conductor of E) 
and a non-constant map 


X(N) — E. (5.3.1) 


In particular, this argument shows, in a roundabout way, that as an abstract group Dire (4) is 
isomorphic to the free group on two generators. 


‘©The definition of semistable was given in §2, in the discussion preceding Theorem 2.8. 
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5.4 Congruence Defects 


Following Khare, we define the congruence defect cd(T) of a finite index subgroup 
T of Pee (4) to be the index of I in its congruence hulli.e., cd(C) = [I°° : I]. The 
congruence defect measures how far I’ is from being a congruence subgroup. The 
congruence defect of I’ is 1 if and only if I’ is a congruence subgroup. Next, we 
define the congruence defect cd(£) of an elliptic curve E to be 


I C Vfpee(4) and F \ H admits 


(5.4.1) 
a non-constant map to E 


cd(E) = min  od(T) | 


The existence of the congruence defect of an elliptic curve defined over Q follows 
from the reformulation of Belyi’s Theorem (Lemma 5.1). 


Question 5.3 (Khare). Let K/Q be a number field. Does there exist a constant 
cd(K), the congruence defect of K, such that for every elliptic curve E'/ K, we have 
cd(E) < cd(K)? In other words, is the congruence defect bounded independently 
of the elliptic curve and solely in terms of the field of definition? 


By the modularity of elliptic curves over Q, the congruence defect of Q exists and 
is equal to 1. The existence of the congruence defect for any number field different 
from Q is not known. 


5.5 Modularity over general number fields 


Khare’s question 5.3 can be seen as a possible generalization of the modularity of 
elliptic curves over Q to arbitrary number fields. There is a more standard conjec- 
ture, coming from the Langlands correspondence, which offers a generalization of 
the modularity of elliptic curves over Q to modularity, or automorphy, of elliptic 
curves over general number fields. 


Conjecture 5.4. Let K/Q be a number field and Ax the adeles of K. Let E be 
an elliptic curve over K. Then there exists a automorphic representation zg of 
GL (2, Ax) which “corresponds” to E in the sense that: 


1. The L-functions of E and zg are equal: L(E,s) = L(zz,5). 

2. As £ ranges over primes of Q, the (duals of the) Tate modules 7;(£) form a 
compatible system of £-adic Galois representations {¢} attached to zz. 

3. For all but finitely many primes p of K, the characteristic polynomial of 
pe(Frob ») is (x — ap)(x — By), where ap, Bp are the Langlands parameters of 
the local component (7), of sg at p. 


For an introduction to the Langlands correspondence between Galois represen- 
tations and automorphic representations, see Taylor’s ICM talk [Tay02] and its 
expanded version [Tay04]. In particular, all of the terms used in the statement of the 
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conjecture above are explained in Joc. cit. For the third condition in the conjecture, 
see also [Tay94]. Conjecture 5.4 has been proved completely for K = Q and for 
totally real fields!’ satisfying certain technical conditions, using the techniques of 
modularity lifting theorems, as described in loc. cit. When the current machinery 
of modularity lifting theorems is perfected—there is reason to believe that this 
will happen in the next few years—it should allow one to prove, as a corollary, 
Conjecture 5.4 for all CM fields.!8 Beyond that, if K is not a CM field, then 
Conjecture 5.4 is wide open and appears to require genuinely new ideas. 

In the simplest case of K = Q, both the answer, cd(Q) = 1, to Khare’s 
Question and Conjecture 5.4 are equivalent to the modularity of elliptic curves 
over Q (Theorem 5.2). However, it is hard to say whether such a relationship 
between Khare’s question and Conjecture 5.4 persists over any other number field. 
For example, for number fields K where Conjecture 5.4 is known, it is not known 
whether this has any implication towards Khare’s question for those K. Let K be 
a number field different from Q. It would be extremely interesting if an affirmative 
answer to Khare’s question (Question 5.3) could be used to say something about 
Conjecture 5.4. 

One significant advantage of Khare’s question is that, like (5.3.1), itis geometric, 
in that it asserts the existence of maps between algebraic varieties. Conjecture 5.4, at 
least as stated above, lacks the same kind of geometric flavor. When K is totally real, 
there sometimes exists a geometric equivalent of Conjecture 5.4, in analogy with the 
K = Qcase. When this analogy works, the modular curves of (5.3.1) are replaced 
by Shimura curves, which can be thought of as arithmetic quotients of the upper 
half-plane by groups of units of quaternion algebras. It would be very interesting to 
find a connection, for some totally real fields, between parameterizations of elliptic 
curves by Shimura curves on the one hand, and the maps involving modular curves 
afforded by a positive answer to Khare’s question. 

However, even in the totally real case, one does not always have a parametrization 
by a Shimura curve. Moreover, when K is not totally real, there is no known idea 
of how to parameterize elliptic curves over K by Shimura varieties. This makes 
the possibility of Khare’s question providing such a parametrization over arbitrary 
number fields all the more exciting. 


6 Some personal remarks 


When I met Lang, I was 16 years old and he was 77. He was giving two talks in one 
day, on two totally different subjects, with only an hour break in-between, to give the 
audience a chance to catch its breath. Never again have I seen anyone captivate and 


"Recall that a number field K is totally real if the images of all its complex embeddings are 
contained in the real numbers. 

'8Recall that a CM field is a totally imaginary quadratic extension of a totally real field, the 
prototypical example of CM fields being imaginary quadratic fields. 
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inspire an audience, generating such electricity in the room, as when Lang lectured 
on the ABC Conjecture and the Heat Kernel. I walked out of the talks feeling, “I 
want to work on this stuff!”’. 

The next day Lang made sure that I became his student by showing up at my 
home in Los Angeles. He wasted no time and started doing mathematics with me 
on the spot. And so it went on day after day, with Lang continuously surprising 
me, showing up at my house, calling me early in the morning and late at night to 
do mathematics. That was Lang’s approach to mathematics: A unified process of 
learning, talking and doing research at the same time. 

Since Lang passed away, mathematicians often tell me things like “This problem 
is too hard,” and so on. But then I think of Lang. I immediately hear, “Stop fooling 
around and get back to work!” As I start working, I also hear Lang saying “It’s 
possible, let’s do it, let’s do it right now!”, and then I feel I’ve got to try, as Lang 
would have done if he were around. 
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On the local divisibility of Heegner points 
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Abstract We relate the local -divisibility of a Heegner point on an elliptic curve 
of conductor N, at a prime p which is inert in the imaginary quadratic field, to the 
first €-descent on a related abelian variety of level Np. 
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1 Introduction 


Heegner points on the modular curve Xo(NV), and their images on elliptic curve 
factors E of the Jacobian, enjoy many remarkable properties. These points are 
the moduli of level structures with endomorphisms by the ring of integers of an 
imaginary quadratic field K. Their traces to E(K) have height given by the first 
derivative at s = 1 of the L-function of E over K (cf. [GZ86]), and their ¢- 
divisibility in the Mordell—Weil group controls the first £-descent on E over K (cf. 
[Gro]). 

In this paper, we show how their ¢-divisibility in the local group E(K,), where 
p is a prime that is inert in K, often determines a first descent over K on a related 


B.H. Gross ()) 
Department of Mathematics, Harvard University, Cambridge, MA 02138 
e-mail: gross @math.harvard.edu 


J.A. Parson 
Hood College, Department of Mathematics, 401 Rosemont Avenue, Frederick, Maryland 21701 
e-mail: parson @hood.edu 


D. Goldfeld et al. (eds.), Number Theory, Analysis and Geometry: 215 
In Memory of Serge Lang, DOI 10.1007/978-1-4614-1260-1_11, 
© Springer Science+Business Media, LLC 2012 


216 B.H. Gross and J.A. Parson 


abelian variety A over Q. The abelian variety A is associated to a modular form 
of weight 2 and level Np that is obtained by Ribet’s level-raising theorem from 
the modular form of level N associated to E. This descent result is Theorem 2 
below. To prove the descent theorem, we compare the local conditions defining 
a certain Selmer group for A with those defining the €-Selmer group for E. The 
conditions agree at places of K prime to p, and at p the condition changes from 
the unramified local condition to a transverse condition. The parity lemma proved 
in §5.3 then compares the ranks of the corresponding Selmer groups in terms of the 
{-divisibility of P in E(K,) and allows us to understand a first descent on A/K 
based on Kolyvagin’s determination of the first 2-descent on E/K. 

Some related work on the Selmer group can be found in [BD99, Prop 1.5] and 
[BDO5]; a comparison with the value of the L-function at s = 1 is given in [BD99, 
Thm 1.3]. 


2 The main theorem 
2.1 Heegner points and Kolyvagin’s descent 


Let E be an elliptic curve over Q of conductor N. It is now known that E is 
modular (cf. [BCDTO1]): the L-function L(s,E) = 7,5, ann~* of E over Q is 
the Mellin transform of a modular form f(t) = S°,5, ane?7'"" of weight 2 on 
To(N). Together with the isogeny theorem of [Fal83], this implies that there is a 
dominant morphism 

aw: X(N) OE 


over Q, where Xo(V) is the modular curve classifying elliptic curves with a cyclic 
N-isogeny (cf. [BSD75]). We will assume z has minimal degree, and that it maps 
the cusp oo to the origin of E. Then z is determined up to sign. 

Let K be an imaginary quadratic field where all primes dividing N are split, and 
choose a factorization (V) = n-n with ged(n, n) = 1 in the ideals of the ring of 
integers Ox of K. The isogeny of complex elliptic curves C/Ox —> C/n7! defines 
a point x € Xo(N)(C). By the theory of complex multiplication, x is defined over 
the Hilbert class field H of K. We define 


P = Try/x ™(x) in E(K). 


Then P is defined up to sign by F and K. For these facts and a general introduction 
to Heegner points, see [Gro84]. 

We will assume in the rest of this paper that P has infinite order in E(K). By the 
main result in [GZ86], this condition holds precisely when L’(1, E/K) 4 0. Since 
E(K) is finitely generated, P can be divisible only by a finite number of primes £ 
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in E(K). Kolyvagin showed that the group E(K) has rank 1, and he completed the 
first €-descent at the primes that do not divide P. More precisely, assume that 


(a) £ is an odd prime that does not divide P in E(K), 
(b) the Galois representation 


p : Gal(Q/Q) > Aut(E[£](Q)) 


is surjective. 


Recall that the £-Selmer group Sel(E/ K, €) is the subgroup of classes in H!(K, E[€]) 
consisting of classes c whose local restrictions c, € H!(K,, E[é]) lie in the 
images of the local Kummer maps E(K,) > H!(K,, E[€]). Kolyvagin showed the 
following: 


Theorem 1. Assume that P has infinite order in E(K) and that conditions (a) and 
(b) hold. Then the €-Selmer group Sel(E/K, £) has dimension | over Z/£Z, and it 
is generated by the image of P € E(K) under the global Kummer map E(K) —> 
H'(K, E[é)). 


A proof is given in [Gro]. 


2.2 Level-raising and local divisibility of P 


Now let p 4 € be a prime that is inert in K. Then p does not divide N. Under the 
conditions of Theorem 1, we will consider the local divisibility of P by £ in E(K,). 
The following observation motivates the analysis below: 


Lemma 1. [f the Heegner point P is not divisible by € in E(K,), then 
(c) ap =+(p +1) (mod £). 


More precisely, the sign in c) can be taken to be —e, where € is the sign of the 
functional equation of L(s, f). 


Proof. Since P is not divisible by ¢ in E(K,), it has non-zero image in E(K,)/ 
CE(K,). As p does not divide N £, the latter group is isomorphic to €(F,2)/LE(F ,2), 
where O, is the ring of integers of K, and €/O, is the Néron model. Since €(F,2) 
is a finite group, the elliptic curve €/F,,2 has a rational €-torsion point. Therefore, 


Frob*, acts on E[¢](Q) with eigenvalues (1, p”). Since the determinant of Frob, on 
E[€](Q) is p, its eigenvalues on E[é](Q) are (+1, +p). As the trace of Frob, on 
E[£](Q) is equal to ay modulo £, this completes the proof of (c). 

The more precise statement about the sign follows from the formula 


P =-eP +f, 
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where tf € E(Q) is a torsion point (cf.[Gro84]): by assumption (b), the order of f is 
prime to £, and so the image of P in E(K,)/€E(K,) satisfies Frod,(P) = —eP. 
Consequently, €/F,,2 has a rational £-torsion point in the —e eigenspace for Frobp, 
and the eigenvalues of Frob, are —€ and —ep. oO 


From now on, we assume that condition (c) holds, which is automatic when P is 
not divisible by ¢ in E(K,,) by the lemma. Conditions (b) and (c) are the hypotheses 
of the level-raising Theorem 1 of [R]. This theorem produces a normalized newform 
g of level dividing Np that is p-new and that has trivial Nebentypus character. The 
theorem also constructs a place A of Q over @, and g and A have the property that 
for all rational primes g # p, one has 


ag(f) = aq(g) (mod A). (2.1) 


2.3. The Eichler—Shimura construction 


Let h € S2(T(M),C) be a normalized newform, and let F = Q(h) be the 
subfield of C generated by its Hecke eigenvalues. The Eichler-Shimura construction 
associates to h a pair (A,i), where A is an abelian variety up to isogeny over Q of 
dimension [F : Q], and where i : F —> End°(A) is an isogeny action of F on A 
that is defined over Q. 

Recall the construction: corresponding to the newform h, one has an algebra 
homomorphism Q ®z T — F, where T is the Hecke algebra at level M. The object 
(A,7) is Homger(F, Jo(M)), suitably interpreted, where Jo(M) is considered as 
an abelian variety over Q up to isogeny with action of Q ® T as endomorphisms. 
Since Jo(M) has good reduction at finite places of Q not dividing M, so does 
A. Let be a finite place of F over the place w of Q. The w-adic Tate module 
V..(A,i) = Fy ®Q,, Vv(A) is a 2-dimensional vector space over F,,, which admits 
an action of Gal(Q/ Q) that is unramified away from M and w. The Eichler-Shimura 
relation implies that for any finite place v of Q prime to M and w, the characteristic 
polynomial of (arithmetic) Frobenius at v acting on V,,(A,i) is T? — a,(h)T + q, 
where the Hecke eigenvalue a,(/) is considered as an element of Q(h) = F C Fy. 


2.4 The theorem 


Let (A,i) be the object associated by the Eichler-Shimura construction to the 
newform g provided by Ribet’s level-raising theorem. There is an abelian variety 
over Q in the isogeny class A such that the maximal order R C F = Q(g) acts 
on A compatibly with i (cf.[Shi98], §7.1, Proposition 7). We fix one such abelian 
variety in the isogeny class, and we write (A,i) for it as well. Many constructions 
below depend on the action i : R — End(A), but since it is fixed throughout, we 
will generally omit it from the notation. 
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Let J be the maximal ideal of R induced by the finite place A of Q provided by 
Ribet’s theorem. The main result below compares the first J-descent on A/K with 
the first £-descent on E'/ K. The /-descent structures on A are very similar to the ¢- 
descent structures on an elliptic curve. Since J need not be principal, however, one 
must often tensor with powers of the R-modules J and J! to define maps whose ¢- 
descent analogues would involve multiplying by powers of the generator £ of (€) C 
Z,. For example, for an R-module M, the R-linear analogue of the multiplication- 
by-€ endomorphism is the homomorphism J ®r M — M: if I is principal, 
then the choice of a generator x, which can also be viewed as an isomorphism 
x : R — I, converts 1 @®r M — M into the endomorphism of multiplication by 
xonM. 

To complete the first /-descent, we define an /-Selmer group of A/K as 
follows: let I~! @p A be the K-scheme that represents the functor T > [ 1 @p 
A(T) = Homar(/, A(T)). We show that this functor is representable, and we give 
more details on the abstract formalism of /-descent in the appendix below. Then 
I~! @p Ais an abelian variety of the same dimension as A/K, on which R acts as 
endomorphisms. For example, if A(C) = C!**@l/A, then the lattice A has a natural 
R-module structure, and I~! @p A(C) = Cl'2/I—! @p A. (The embedding of 
I~! @p A asa lattice in Cl is the unique extension of the embedding of its 
finite-index subgroup A.) 

The inclusion R — J~! induces an isogeny A > I~! @ p A defined over K with 
kernel A[/], the group scheme of /-torsion sections of A. We thus have the exact 
sequence 


Os A) ase wg Ss Pe oS 


of group schemes over Spec(K). Passing to Galois cohomology, we find a global 
Kummer (boundary) map 


I @r A(K) > H{(K, A[/]). 


In the familiar situation of €-descent, the boundary map E(K) — H!(K, E[é]) 
maps a point x € E(K) to the E[£]-torsor composed of the points {¢~!x} C E(K). 
In the 7-descent formalism, we would replace x € E(K) with (€~'!) @x € (2)! @ 
E(K). The image of (€~') @ x under the Kummer map is the fiber over this point 
of the isogeny E > (£)7! @ E. 

For each place v of K, there are analogous local Kummer maps 


I~! @r A(K,) > H'(K,, AL). 


The [-Selmer group of (A,i), denoted Sel(A/K, /), is the group of classes c € 
H!(K, A[/]) such that the restriction c, € H!(K,, A[/]) is in the image of J~! @p 
A(K,) for all places v of K. Evidently the global Kummer map factors through 
Sel(A/K,1) C H!(K, A[/]). 
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Just as for standard ¢-descent, one can show that Sel(A/K, /) is a finite group 
and hence a finite-dimensional R /J-vector space. The kernel of the global Kummer 
map I~! @p A(K) > Sel(A/K, /) is the image of the natural map 


R @p A(K) > 17! @p A(K), 
and so the image of the Kummer map is J~!/R @p A(K) or, equivalently, 
I @r (R/T @pr A(K)). 


The R/J-dimension of Sel(A/K, /) is thus an upper bound on the rank of A(K) 
as R-module, just as the Z/£Z-dimension of Sel(E/K, £) is an upper bound on the 
rank of E(K) as Z-module. 

Let us impose the following additional assumptions on p, N, and E: 


(d) £ is prime to N, 
(e) For each prime qg dividing N, the dimension of the g-inertia invariants of the 
modulo-£ Galois representation 


p : Gal(Q/Q) > Aut(E[¢](Q)) 


is 1, if E has multiplicative reduction at q, and is 0, if E has additive reduction 
at q, 
(f) p? £1 (mod £). 


Since £ is prime to N, condition (e) is equivalent to the statement that the 
conductor of p in the sense of [Ser87] is equal to the conductor N of E/Q. By level- 
lowering theorems (cf. [Dia95]), condition (e) means that f is a form of minimal 
level among those giving rise to the modulo-¢ representation p. It also implies that 
the form g constructed by level raising has level Np. 

We will deduce the following result about the /-descent on A/K from Kolyva- 
gin’s result stated in Theorem |: 


Theorem 2. Assume that P € E(K) has infinite order and that conditions (a)-(f) 
hold. Then 


dimr/; Sel(A/K, 1) = 0, if P is not divisible by € in E(Kp), and 
dimr/; Sel(A/K, I) = 2, if P is divisible by £ in E(K,). 


The J -adic Tate module 
an PB Qn N1V(ay 
T/(A,i) = lim" @r A[I"](Q)) 


is a lattice in V,(A,i) and is thus free of rank 2 over the /-adic completion 
of R. Therefore, R/I ®r T;(A,i) = I @pr A[l]|(Q) and hence A[/](Q) is 
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2-dimensional over R/J. By the Eichler-Shimura relation and (2.1), the charac- 
teristic polynomials of prime-to-Np£ Frobenius elements on R/I @ E[£](Q) and 
A[I](Q) agree. Thus, by the Brauer—Nesbitt principle, these Galois modules have 
isomorphic semi-simplifications. By assumption (b), the Galois module R/J ® 
E[€|(Q) is irreducible, and so the two (R/J)[Gal(Q/Q)]-modules are isomorphic. 
We choose and fix such an isomorphism for the rest of the paper. 

The plan for proving Theorem 2 is to compare the local conditions defining 
Sel(A/K, 1) C H!(K, A[/]) and 


R/I ® Sel(E/K,) C R/I @H'(K, E[f]) =H'(K, R/I ® Efé]), 


by means of the fixed isomorphism R/J ® E[¢]/Q = A[/]/Q. Using assumptions 
(a)—-(f), we describe these local conditions entirely in terms of the Galois module 


A[I](Q) = R/T ® E[Q). 


The local conditions agree at all places v of K except at v = p, where the two 
conditions are transverse. By combining this local analysis with a global parity 
lemma, proved in §5.3, we deduce Theorem 2 from Theorem 1. In §4 we study 
the local conditions defining the Selmer groups at places v # p of K, and in §5 
we study the conditions at v = p and use the parity lemma to prove the theorem. 
In §6 below, we make explicit the compatibility with the parity predictions of the 
conjecture of Birch and Swinnerton-Dyer. 


3 Néron models of abelian varieties with real multiplication 


This section contains preliminary remarks that will be applied in Sections 4 and 5 
to Néron models of E and A in order to describe the images of the local 
Kummer maps entirely in terms of the Galois modules underlying E[€]/Q and 


A[T]/Q. 


3.1 Semi-stable case 


Let L be the quotient field of a Henselian discrete valuation ring Oz with residue 
field k,, and let B/L be an abelian variety. Let Fp be a number field with maximal 
order Ro. Assume that the dimension of B/L is [Fo : QJ]. Let j : Ro — 
End(B/L) be a faithful action of Ro. In what follows, we will use Fy = Q and 
fo = F. 

Let B/O_ be the Néron model of B/L. We consider first the case when B/L 
has semi-stable reduction. This condition means that the identity component B°/k 
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of the special fiber of the Néron model is an extension of an abelian variety by a 
torus T/k,. Let kr bea separable closure of k,. The torus T/k, is determined 
by its geometric character lattice X*(T/k,), which is a free Z-module of rank 
dim(7), with the induced action of Gal(k,/k ,) on this lattice. The semi-stable 
abelian variety B/L has good reduction, if the torus component of the special fiber 
is trivial. In this case, the special fiber is connected and is an abelian variety, and so 
B/Oy, is an abelian scheme. If the abelian-variety component of B°/k,, is trivial, 
then B/L has purely toric reduction. 


Lemma 2. /f B/L has semi-stable reduction, then B/L has either good reduction 
or purely toric reduction. In the purely toric-reduction case, the functorial action of 
Ro on X*(T/kz) makes this lattice an invertible Ro-module. 


Proof. By functoriality of the Néron model, the action 7 induces a (unital) ring 
homomorphism Ry — End(T/k,) = End(X*(T/k,)). Tensoring with Q, we find 
an Fy-vector-space structure on Q @ X*(T/k,). Therefore, the dimension of T/k, 
is a multiple of [Fy : Q] = dim(B). Since dim(T) < dim(B/L) = dim(B°/k_), 
we see that either T = 0 or T = B°/k_. In the first case, B/L has good reduction. 
In the second case, B/L has purely toric reduction, and X*(T/k,) is an Ro-lattice 
in the 1-dimensional Fy-vector space Q @ X*(T/k ). Thus the character lattice is 
an invertible Ro-module, since Ro is the maximal order of Fo. oO 


Consider the purely-toric reduction case of the lemma. Since T is split over k, 
we have the natural isomorphism 


T/k, = Hom(X*(T/kz), Gm), 


which is a functorial expression of the fact that T/k is isomorphic to a product of 
copies of the multiplicative group G,,, indexed by any basis of X*(7'/kz). Thus for 
any ideal Jp of Ro, we have 


T [Io]/kK_ = Hom(X*(T/k1), Gm) [Io] = Hom(Ro/ Io @ xy X*(T/Kx), Gn). 
Therefore, we have 
T [Iol(Ki) = Homz(Ro/ Io ®z, X*(T/K1). Kz. ): 


As an abstract group, ki is isomorphic to [| | pein Q, /Zq, where q runs over primes 
not equal to the characteristic go of k,. Consequently, if Jo is a maximal ideal such 
that the order of Ro/Jo is invertible in k,, then T[Io|(Kz) is 1-dimensional over 
Ro/ Io. We state this fact as a lemma for later reference. 


Lemma 3. Assume that B/L has purely toric reduction. Let Iy be a maximal 
ideal of Ro such that the order of Ro/Io is invertible in ky. Then B°[Io|(k i) is 
1-dimensional over Ro/ Ip. 
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3.2. Component groups in the general case 


We now consider abelian varieties B/L with endomorphisms j : Ro > End(B) as 
above, but we allow arbitrary (not necessarily semi-stable) reduction. Let B/Oz be 
the Néron model of B/L, and let B°/k,, be the connected component of the identity 
of B/k,. In the general case B°/k,, is a successive extension of an abelian variety, 
a torus, and a connected unipotent group. In this section, we are concerned with the 
component group scheme ® = B/B?° of the special fiber. It is a finite, étale Ro- 
module scheme over k;,. Let B°/O,, be the smooth group scheme whose generic 
fiber is B/L and whose special fiber is B°/k , i.e., B°/Oz is the complement in 
B/O_, of the non-identity components of 6/k_. 

Let Ig C Ro be a maximal ideal such that the order of Ro/Jo is invertible in 
O, and thus in L and k;. Let us see how the group ®[Jo](k,) appears in the 
Galois module B[Io]/L. Consider the Ro-module extension of ®[Jo](k,) by B°(kz) 
obtained by restricting 


0 > B°(k,.) > B(kL) > ®(k,) > 0 


to ®[Io](K1). We claim that this extension of Ro-modules splits. Since the group 
®[J|(K_) is a direct sum of copies of Ro/ Jp as Ro-module, it suffices to show that 


Ext, (Ro/ Io, B°(k1)) = 0. (3.1) 


To see this vanishing, we consider the homomorphism B — I) '@p,B of smooth 
group schemes over O_, as in the formalism of Jo-descent discussed in the appendix. 
On the relative Lie algebras, this morphism is the natural homomorphism of free 
O,-modules 


Lie(B/O1) > LieU> | @rx, B/OL) = 15 | @r, Lie(B/Ox). 


The kernel and cokernel are annihilated by Jp; since Jp contains the order of Ro/ Jo, 
which is a unit in O;, the map is an isomorphism. Consequently, the homomorphism 
of smooth group schemes B > I,! ®p, B over Oz is étale. Since the geometric 
fibers of B° and I>! @p, B° over O, are connected, the homomorphism B° > 
I5| ® rp B° is thus étale and surjective. In particular, B°(k,) > I; ! @r, B°(kz) is 
surjective. 

Consider the projective resolution 


(—) pes Ry Ri 


of the Ro-module Ro/ Jo. From the long exact Extr,-sequence with coefficients 
B°(k,), we find that Ext, (Ro/ Jo, B°(k_)) is the cokernel of 


Homa, (Ro, B°(k,)) ——> Homa, (Io, B°(Kz)). 
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Since the map 
B°(k,) = Homa, (Ro, B°(ki)) > Home, (Io, B°(ki)) = Ih! ®x B(k1) 


is surjective, we obtain the desired vanishing (3.1). 
If we choose a splitting, then we find an Ro/Jo-module isomorphism 


BUlol(ki) = BUol(Ki) ® ®[Lo] (Kz). (3.2) 


Let L be a maximal unramified extension of L with residue field kx, and let Or be 
the valuation ring of L. By the Néron property, we have B[Jo](L) = B[Io](O7). 
Since B[Jo]/O7x is étale (as the kernel of an étale homomorphism B > I! @z, B 
over O,) and O, is Henselian, the reduction map B[Jo](O7) > B [Tol(Kz) is an 
isomorphism. Therefore B [Jo](L) = BlIol(k,). The Ro/Jo-module BUIo|(L) is 
the space of inertia invariants in B[Jo](L), where L is a separable closure of L 
containing L.In summary, we have 


Lemma 4. /f the order of Ro/Io is invertible in O,, then as Ro/Io-module, the 
space of inertia invariants in B{Io|(L) is B°[Io\(ki) ® ®[Io] (Kz). 


3.3. Reduction of E and A at finite places of K 


We now describe some aspects of the reduction of E and A at primes q of Q and at 
finite places v of K using the results of this section and hypotheses (a)—(f). Since NV 
is the conductor of E'/Q, if g and v are prime to N, then £/Q has good reduction at 
q, and E/ K has good reduction at v. Let v be a place of K dividing N and let q be 
the prime of Q lying under v. The elliptic curve E'/Q then has either multiplicative 
or additive reduction at g. Since, by assumption, g is unramified and splits in K, 
we have Q, = K,, and the elliptic curve E/K has the same reduction at v as 
E/Qat q. If E/Q has additive reduction, then by condition (e), there are no non- 
zero inertia invariants in E[€](Q,). Thus by Lemma 4, since ¢ is prime to N, the 
component group of the reduction at q has no €-torsion. If E'/Q has multiplicative 
reduction at q, then the inertia invariants in E[€](Q,) are 1-dimensional over Z/€Z, 
by assumption (e). By Lemma 3 and 4, all of these inertia invariants come from the 
torus part of the reduction, and the component group of the reduction at g again has 
no non-zero €-torsion. 

If g and v are prime to Np, then the abelian variety A/Q has good reduction at 
q, and A/K has good reduction at v, since A/Q is an isogeny factor of Jo(Np)/Q, 
which has good reduction away from Np. It follows from the construction of the 
semi-stable model of Jo(Np)/Q, over Zp (cf. [DR73]) that the new quotient of 
Jo(Np)/Q has purely toric reduction at p. Since the eigenform g used to construct 
A/Q is new at p, the abelian variety A/Q thus has purely toric reduction at p. 
Therefore, A/K has purely toric reduction at places v of K dividing p. 
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By condition (e), the Galois module E[¢](Q) is ramified at primes q dividing NV. 
Since R/I ® E[€\(Q) = A[I](Q as Galois modules, the Galois module A[/](Q) 
is also ramified at primes g dividing N. Therefore, A/Q, has bad reduction at all 
primes q dividing NV. As above, if v is a place of K dividing N and lying over the 
rational prime q, then qg is unramified and split in K. Therefore K, = Q,, and so 
the reduction of A/K at v is the same as the reduction of A/Q at q. If qg divides 
N but q? does not divide N, then A/Q has semi-stable reduction at g (and hence 
v), since it is an isogeny factor of Jo(Np)/Q, which has semi-stable reduction at 
q. Since A/Q has bad reduction at g, it has purely toric reduction by Lemma 2. 
The inertia invariants in A[J](Q,) are 1-dimensional over R/J by assumption (e) 
and the isomorphism R/I @ E[¢](Q) = A[/](Q). Therefore, by Lemmas 3 and 4, 
the component group of the reduction of A/Q at g has no /-torsion. On the other 
hand, if g? divides N, then E/Q has additive reduction at q, the inertia invariants 
in A[/](Q,) are 0-dimensional over R/J by assumption (e) and the isomorphism 
R/I ® E[€\(Q) = A[I](Q). Therefore, by Lemma 4, the component group of the 
reduction of A/Q at g has no J -torsion. 


4 Proof of Theorem 2: local conditions at v # p 


To prove Theorem 2, we will compare 
R/T ® Sel(E/K,£) C R/I @H'(K, E[€]) = H'(K, R/T ® E[€)) 


with Sel(A/K,7) Cc H!(K, A[/]). Here we identify H'(K, R/J @ E[é]) with 
H!(K, A[/]) using the fixed isomorphism R/J @ E[¢€] = A[J] over Q from 
§2.4. We compare these two Selmer groups by comparing the local conditions in 
H!(K,, A[J]) that define them. Let L, Cc H!(K,, A[/]) be the R/J-span of the 
image of 

E(K,) > H!(K,, E[€)) Cc H'(K,, A[Z]). 


Then R/I ® Sel(E/K, £) consists of the classes c € H!(K, A[/]) such that each 
restriction c, € H!(K,, A[/]) belongs to Ly. Let L’ Cc H!(K,, A[/]) be the image 
of the local Kummer map J~' @p A(K,) > H!(K,, A[J]), so that Sel(A/K, J) is 
the set of classes c € H'(K, A[J]) such that c, € L’, for all places v of K. 

The remainder of the present section is devoted to proving 


Lemma 5. For v # p, we have L, = L'. 


We prove the lemma by deducing from assumptions (a)-(f) descriptions of L, 
and L’ entirely in terms of the Galois modules E[€]/Q and A[J]/Q. The fixed 
isomorphism R/I ® E[€] = Al[Z] over Q then allows us to identify the local 
conditions as in the lemma. In §5 below, we complete the local comparison by 
analyzing L, and L’,, which is the only place at which the defining local conditions 
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for the two Selmer groups differ. With the local comparison in hand, we use a parity 
lemma based on local and global duality theory in Galois cohomology to deduce 
Theorem 2 from Theorem 1. 


4.1 Types of local conditions 


Let Oy, be a discrete valuation ring with quotient field L and residue field k,. Let 
V be a locally constant constructible sheaf of abelian groups on the small étale site 
of Spec(L). The data of V is carried equivalently by the finite-order Z[Gal(L/L)]- 
module V(L), where L/L is a separable closure. 

The subspace of H!(L,V) composed of classes that split over an unramified 
extension of L is denoted H1,,(L, V). Alternatively, if j : Spec(L) > Spec(Oz) 
denotes the inclusion, the unramified classes are 


H;,(Spec(Oz), jxV) C Hg,(Spec(L), V) = H'(L,V). 


From either description, it is clear that formation of unramified classes is functorial 
inV. 

Suppose that there is a finite, free group scheme G/ Spec(O_) whose restriction 
to Spec(L) represents V. One then has the subspace of flat classes valued in G/Oz 


H,(Spec(Oz), G) C Hj (Spec(L), G) = H'(L, V). 


The formation of such a subspace is functorial in the flat model G over Spec(O_). 


4.2 Identifying local conditions for Ip-descent on abelian 
varieties 


We return briefly to the general notation of §3: O, is a Henselian discrete valuation 
ring with quotient field L and residue field kz; one has an abelian variety B/L, 
equipped with an action of the ring of integers Ro of a number field Fo such that [Fo : 
Q] = dim(B/L). Let B/O_ be the Néron model, and let ®/k, be the component 
group scheme of 6/k,. Let Ig C Ro be a maximal ideal such that the order of Ro/ Io 
is invertible in L. Then the homomorphism B > J)! @x, B is an étale isogeny 
with kernel B[Jo], and we have the Kummer map J)! ®r, B(L) > H'(L, B[Io)). 
The following two lemmas identify the image of the Kummer map under certain 
restrictions on the reduction of B/L. The cases with Ro = Z are entirely standard 
facts (cf. [Maz72], for instance). 


Lemma 6. [f the order of Ro/ Io is invertible in Ox, and ®[Io| = 0, then the image 
of the Kummer map in H'(L, B[Io]) is H),.(L, Bo). 


unr 
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Proof. Since the order of Ro/Jo is invertible in Oz, as in §3.2, we find that the 
homomorphism B > I, | @p, B is étale and that B° > I>' ®p, B’ is surjective and 
étale. Since ®[Jp] = 0 and ®/k;, is finite étale, the natural map ® > I>! @z, ® 
is an isomorphism, and so B > Ip ' @ B is surjective and étale. One thus has 
the Kummer maps for B/L and B/Q_, as discussed in the appendix, which are 
connected by restriction maps in étale cohomology: 


Ty! ®r, BOL) ——~ Hj(Spec(Oz), Blo]) 


| | (4.1) 


Ix! ®p, B(L) ——> H}(Spec(L), BU). 


By the Néron mapping property we have B(O,) = B(L), and so the left vertical 
arrow in an isomorphism. Furthermore, the top arrow is surjective, which one sees 
as follows: 

The cokernel of the top arrow is the kernel of 


Hi,(Spec(O),B) ——~+ H4,(Spec(Oz), Ip! ®r B), 


and so to see surjectivity, it suffices to check that Hi,(Spec(O;,), B°) = 0 and that 
® — [,' ®p, ® is an isomorphism. The second fact follows from ®[Jo] = 0, as 
noted above. To see the first fact, recall that a class in H'(Spec(Oz;), B°) can be 
represented by a right torsor under the étale sheaf represented by B°. Since the base 
O, isa discrete valuation ring and B° is smooth over QO, this torsor is representable 
by a scheme over Spec(O,) by a theorem of Raynaud (cf. [Mil80], Chapter III, 
Theorem 4.3). It suffices therefore to show that any scheme P over Spec(O_z) 
that is a right B°-torsor has a section. Since B°/k, is connected, Lang’s theorem 
(cf. [Lan56], Theorem 2) implies that P(k,) is non-empty. As B°/ Spec(O,,) and 
hence P/ Spec(Q ,) are smooth and Oy, is Henselian, the map P(O,) > P(k_z) is 
surjective. Consequently, the element of P(kz) provided by Lang’s theorem lifts to 
a section over O_. 

Since the left vertical arrow in (4.1) is an isomorphism and the top arrow is a 
surjection, the image of the Kummer map in H'(L, B[Jo]) is Hi,(Spec(Oz), B[Jo)). 
The Néron mapping property identifies j,,.B = 6 as sheaves on the small étale site 
of Spec(O;,). Thus 


Blo] = (+B) [To] = jx(B[o]). 
and H},(Spec(Q;), B[Jo]) = H!,,(L, BUo]). Oo 


Alternatively, assume that the order of Ro/ Jo is invertible only in L, but consider 
only B/L with good reduction. The Néron model B/Oz is then an abelian scheme. 


Lemma 7. [f B/L has good reduction, then B[Io|/Ox is finite and locally free. 
Furthermore, the image of the Kummer map I! ®p, B(L) > H'(L, B[Io]) is 
Hj (Spec(Oz), B[Zo)). 

Proof. As explained in the appendix, since B/Oz is an abelian scheme, the 
homomorphism B —> J, ! @p, B is flat and surjective. Furthermore, the kernel 
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BlIo|/Ox is finite and locally free. To complete the proof othe lemma, one can 
apply the same argument as in Lemma 6, replacing étale cohomology with flat 
cohomology. oO 


4.3 Proof of Lemma 5 


For v the infinite place of K, we have K, = C, and so H'(K,, A[/]) = 0 and 
L, = Li =0. 

Next consider a finite place v of K prime to pf. Let E/O, and A/O, be the Néron 
models of E'/K, and A/K,, respectively. Let ®g and ®, be the component group 
schemes of the special fibers of €/O, and A/O,, respectively. By the discussion in 
§3.3, we have ®g[€] = 0 and ®,4[/] = 0. Therefore, Lemma 6 implies that 

Ly=R/1 @H(K,. EIQ) and LO=H 


unr 


mr(Ky, A[Z]). 


unr 


Consequently, L, = Li. 

Finally, let v be a place of K dividing ¢. Let E/Z, and A/Z, be the Néron models 
of E/Q, and A/Q,, respectively. Since £ is prime to Np, these models are abelian 
schemes. Furthermore, the base changes €/O, and A/O, are the Néron models 
of E/K, and A/K,, respectively. The R/J-vector schemes R/I @z E[€]/Ze and 
A[I]/Ze are finite, free models of A[/]/Q,. Since £ > 2, by Theorem 3.3.3 of 
[Ray74], the identification of their generic-fiber Galois modules extends uniquely 
to an isomorphism R/J ® E[€] = A[J] over Ze. By Lemma 7, we have 


Ly = R/I @ H,(Spec(O,), E[é]) and L’ = H4(Spec(O,), A[/]). 


Therefore, L, = L’. 


5 Proof of Theorem 2: local condition at vy = p 
and the parity lemma 


5.1 The conditions at v = p 


The Gal(K,/K,)-module E[é](K,) is unramified at p, since p is prime to the 
conductor N of E/Q. By assumption (c), the eigenvalues of Frob,2 on E [€](Kp) 
are | and p”. By assumption (f), we have p? # 1 (mod £), and so E[€](K,) splits 
as a sum of the Frob,,2-eigenspaces for the eigenvalues 1 and p’. Therefore, we have 
an isomorphism of R/J-vector schemes (or (R/J)[Gal(K/K ,)]-modules): 


AlI]/Kp = R/I ® R/I(A), 


where R/J(1) is the R/J-vector scheme R/I @ Le. 
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Lemma 8. The spaces H'(K,, R/T) and H'(K,, R/I(1)) are each 1-dimensional 
over R/I. Furthermore, 


Lp =H'(Ky R/T) and L’, = H'(Ky, R/1(1)). 


Proof. We have H'(K,,R/I) = Hom(Gal(K,/K,), R/1). Since R/I is ab- 
stractly a sum of copies of Z/£Z and £ # p, any such homomorphism is 
tamely ramified. Furthermore, by assumption (f), any tamely ramified homomor- 
phism is unramified. Therefore all classes in H!(K pk /T) are unramified, and 
H!(K,, R/1) > R/T, sending a cohomology class to the image of Frob,2 in R/T, 
is an isomorphism. 

By Kummer theory, we have H!(K,, We) = ey and the unramified 
classes are Oe, )’. Thus by assumption (f), we have Hiw(Kp, He) = 0, and 
H!(K,p, 4e) is 1-dimensional over Z/£Z, generated by the class of a uniformizer in 
KON KY. Since E'/ K, has good reduction we have, by Lemma 6, 


Lp = R/I ® Hy, (Kp, E[€), 
and so we find that L,, is the summand H!(K,, R/J) of H'(K,, A[/]). 

To see that Li = H!(K,, R/I(1)), we will use the rigid-analytic uniformization 
of A/K,. Since it is an isogeny factor of the new quotient of Jo(Np)/Q, the 
abelian variety A/Q, has purely toric reduction. Let A/Z, be its Néron model. By 
Lemma 2, the F -vector space Q @z X*(A°/F,,) is 1-dimensional, and so the group 
Gal(F,/ F,) acts on it via a continuous F*-valued character. Since F is totally 
real, this character must be quadratic. The group scheme A°/F p2 is thus a split 
torus. 

Since A/Q, has semi-stable reduction, the (split) torus AVE ,2 is identity 
component of the special fiber of the Néron model of A/K,. Let T/K, be 
the split torus whose character group is X*(A°/F p2)- Let R act as endomor- 
phisms of T7/K, dual to the action of R on X*(A°/F 2). One then has the 
rigid-analytic uniformization T*" —> A over K, (cf. [BL91]), which yields 
a surjection of R[Gal(K,/K,)]-modules T(K,) — A(K,). The kernel A of 
TUR) => A(K,) is a free Z-module of rank dim(A), on which Gal(K, / K,) acts 
trivially. 

Consider the following diagram comparing the Kummer maps for T/K, and 
A/K>, where the vertical maps come from the analytic uniformization T (Kp) > 
A(K,): 

I~ @rT(K,) —— H(K,, TI) 


| | (5.1) 
I~! @g A(Kp) ——> H!(K>, Al). 


Since A[/] = 0, the uniformization induces an injection T [J] — A[J] over Kp. 
As X*(A°/F p2) is locally free of rank 1 over R, the R/J-module scheme 7T[/] 
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is abstractly R/J(1). Thus the image of T[/] in A[/] is the R/J(1) summand of 
All] = R/I ® R/I(1). Consequently, the image of the right vertical arrow in (5.1) 
is the summand H!(K,, R/1(1)) of H'(K>, A[/]). The cokernel of the top arrow is 
contained in H!(K p» 1’), which vanishes by Hilbert Theorem 90, since T/K pisa 
split torus. Finally, the left vertical arrow is surjective, since H'(K,, A) = 0. These 
three observations imply that the image Li of the Kummer map in H!(K,, A[/]) is 


the summand H!(K,, R/J(1)). oO 


5.2. Comparing the Selmer groups 


To prove Theorem 2, we compare the subspaces R/1 @Sel(E/K, €) and Sel(A/K, J) 
of H!(K, A[/]). Let Sels(A/K,I) C Sel(A/K,1) (with s for “strict’”) be the 
intersection of these two Selmer groups. By Lemmas 5 and 8, the Selmer group 
Sel,(A/K, ) is the space of classes c € H!(K, A[/]) such that for v # p the 
local restriction c, belongs to Ly = L’ and such that the local restriction x, is 0. 
Since the spaces L, and Ly are both 1-dimensional over R/J, the codimension 
of Sel,(A/K,/) in each of R/I @ Sel(E/K, £) and Sel(A/K, /) is at most 1. 
More precisely, by Theorem 1, the Selmer group Sel,(A/K, /) is 1-dimensional, 
if the Heegner point P is divisible by € in E(K,): in this case the restriction to 
p of the image of P in Sel(E/K,£) vanishes, and so R/I ® Sel(E/K,£) = 
Sel,(A/K, /). Therefore, if P is divisible by ¢ in E(K,), the R/J-dimension of 
Sel(A/K, J) is either 1 or 2. Similarly, if P is not divisible by ¢ in E(K,), then 
Sel,(A/K, /) is 0-dimensional; in this case, the R/J-dimension of Sel(A/K, /) is 
either 0 or 1. 

As explained in §6 below, the conjecture of Birch and Swinnerton-Dyer suggests 
that the R/J-dimension of Sel(A/K, /) should be even. In order to finish the proof 
of Theorem 2, we must exclude the possibility that Sel(A/K, /) is 1-dimensional 
over R/J, in harmony with the conjecture. In order rule out the 1-dimensional case, 
we present in §5.3 a variant of an argument which was shown to us by Benjamin 
Howard. 


5.3 The parity lemma 


Let k be a finite field of characteristic £ and let M be a totally imaginary number 
field. In what follows, we will take k = R/I and M = K.Let V be locally constant 
constructible étale sheaf of k-vector spaces over Spec(M ), equipped with a perfect, 
alternating, k-bilinear pairing V x V — k(1). In the application to Theorem 2, we 
will take V = A[J/] and use the pairing coming from the Weil pairing on E[€]. For 
each finite place v of M one then has the perfect, symmetric, k-bilinear Tate-local- 
duality pairing H'(M,, V)xH!(M,, V) > k mapping x x y (x, y),. The duality 
pairing is the composition of the product H!(M,, V) x H'(M,, V) > H?(M,, k(1)) 
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with the reciprocity isomorphism H?(M,,k(1)) — k. For each finite place v of 
M,let A, Cc H'(M,, V) be a k-subspace. Assume that for almost all v, one has 
A, = HL. (M,, V). Assume furthermore that each A, is its own annihilator under 
the Tate pairing. This assumption implies that each H'(M,, V) has even dimension 
over & and that each A, is a totally isotropic subspace of half the dimension of 
H!(M,, V). Thus the Tate pairing on H!(M,, V) is a split symmetric bilinear form, 
and A, is a maximal totally isotropic subspace. 

We distinguish one fixed finite place w of M, which will be the place p of K in 
the proof of Theorem 2. Assume that H!(M,,, V) is 2-dimensional over k, so that 
H!(M,,, V) equipped with the Tate pairing is a hyperbolic plane. There are then 
exactly two maximal totally isotropic subspaces (lines), namely, the given A,, and 
another subspace A’. (Recall that k has characteristic £ # 2.) 

We consider four Selmer groups contained in H!(M,V), defined by the local 
conditions A, for v 4 w and differing only in their defining local conditions at w. 
Let Sel,,(V) (“unramified’’) be defined by the local conditions A, at all places v, i.e., 
Sel,(V) C H!(M, V) is the space of classes x such that the restriction x, belongs 
to A, for all finite places v of M. Let Sel,(V) (“transverse”) be defined by the local 
conditions A, at v # w and by Af, at w. Let Sel,(V) (“relaxed”) be defined by the 
local conditions A, at v 4 w and no condition at w. Let Sel,(V) (“strict”) be defined 
by the local conditions L, at v  w and local vanishing at w. 


Lemma 9. The k-dimensions of Sel,(V) and Sel, (V) differ by exactly 1; moreover, 
either 

Sel, (V) = Sel-(V) and Sel;(V) = Sel;(V), or 

Sel,(V) = Sels(V) and _— Sel;(V) = Sel, (V). 


Proof. Let x, y € Sel,(V). By global class field theory, one has 


ee Vy)» = 0, 


v 


where x,, vy, € H'(M,, V) are the restrictions of x and y. Since x, y restrict to 
elements of the totally isotropic spaces A, for v 4 w, one has 


(Xws Yw)w = is Vy) y =0. 


vy 


Therefore, the image of Sel, (V) in H'(M,,, V) under the restriction map is a totally 
isotropic subspace. Consequently this image is contained in A,, or A/,, and 


Sel,(V) = Sel,(V) or  Sel,(V) = Sel; (V). 


On the other hand, it follows from the global Euler characteristic formula and 
global duality (cf. [DDT94], Theorem 2.19) that the k-codimension of Sel,(V) 
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in Sel,(V) is 1: the local conditions for Sel;(V) are obtained by relaxing the 
local condition at w from the self-dual, 1-dimensional A,, to the 2-dimensional 
H'(M,,, V), and the dual of the relaxed condition at w is the strict condition at 
w defining Sel,;(V). As Sel;(V) = Sel, (V) M Sel, (V) in Sel, (V), one cannot have 
Sel,(V) = Sel;(V) = Sel,(V). Therefore, one has either Sel,(V) = Sel,(V) 
and Sel,(V) = Sel,;(V) or Sel}(V) = Sel;(V) and Sel,(V) = Sel,(V). These 
relations and the fact that the k-codimension of Sel,(V) in Sel,(V) is 1 prove the 
lemma. oO 


5.4 Completion of the proof of Theorem 2 


To finish the proof the theorem, we apply the parity lemma to A[/]/K to compare 
the subspaces R/I @® Sel(E/K, £) and Sel(A/K, I) of H!(K, A[/]). The transfer 
of the Weil pairing on R/J ® E[€] via the isomorphism R/J @ E[t] = A[T] 
to A[/] provides a perfect, alternating pairing A[/] x A[/] — R/J(1). The fact 
that the spaces L, C H!(K,, A[/]) are their own annihilators under the Tate 
pairing follows from Tate local duality for the elliptic curve E/K. Alternatively, 
one can deduce it directly from the description L, = H1,,(K,, A[/]) for v prime 
to £ and Ly = Hj4(Spec(O,, A[/]) for v dividing £ (cf. [Mil86], Theorem 1.2.6, 
Corollary I.1.10(b), and Theorem II.1.8(b)). As we observed above in Lemma 8, 
conditions (c) and (f) imply that A[J]/K, = R/I ® R/I(1) and that H!(K,, A[/]) 
is 2-dimensional over R/J. Each 1-dimensional summand R/J and R/J(1) is 
isotropic for the (alternating) Weil pairing, and so the 1-dimensional summands 
H!(K,, R/1) and H'(K,, R/I(1)) of H'(K,, A[/]) are isotropic for the Tate 
pairing. 

We now apply the parity lemma with M = K, the étale sheaf V = A[/], 
equipped with the Weil pairing, and the local conditions A, = L,. These structures 
satisfy the hypotheses of §5.3. We take the distinguished place w to be the place p of 
K. By the discussion in §5.1, we have A‘, = L’,. Since forv # p, we have L, = Ly, 
the parity lemma compares the Selmer groups Sel,(A[/]) = R/I @ Sel(E/K, £) 
and Sel,(A[7]) = Sel(A/K,/) in H!(K, A[/]). By Kolyvagin’s result stated in 
Theorem |, we know that Sel(£/K, £) is 1-dimensional over Z/€Z, generated by 
the image of the Heegner point P under the Kummer map. Therefore, the parity 
lemma implies Theorem 2. Oo 


6 Compatibility with the functional equation 


In this section we study the signs of the functional equations for L-functions 
related to f and g. Let ¢€ = +1 be the sign of the complete L-function of 
f, ie. if A(s, f) = (27) *T(s)L(s, f) is the complete L-function of f, then 
A(s, f) = €N'*A(2 —s, f). If W is the Fricke involution of level N, then 
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Wf =-—ef by Theorem 3.66 of [Shi94]. Let y be the quadratic Dirichlet character 
corresponding to K/Q. By [Shi94], Theorem 3.66 and Lemma 3.63 (2), the sign of 
the complete twisted L-function A(s, f, x) = (27) “*T(s)L(s, f. x) is x(-N) xe. 
That is, 


A(s, AX) = (XN) x (PN) FAQ—S, £9), 


where r is the conductor of x. Since the primes dividing N split in K and K is 
imaginary quadratic, one has y(—N) = y(—1) = —1, and so the sign of A(f,s, x) 
is —e. Therefore, the sign of A(s, f) x A(s, f, x), which is the complete L-function 
of the base change of f to K, is —1. By the compatibility of the Eichler-Shimura 
construction with the local Langlands correspondence (cf. [Car86]), the complete 
L-function A(s, E/K) = A(s, E/Q) x A(s, E/Q, x) is A(s, f) x A(s, ff x). Thus 
the functional equation for A(s, E/ K) has sign —1, which is what one would expect 
from the Birch and Swinnerton-Dyer conjecture and Theorem 1, since E [£](K) = 0 
by condition (b). 

Recall that the new level of the newform g is Np by condition (e). Let €’ be the 
sign of the functional equation of A(s, g). Then, as in the case of f, the sign of the 
functional equation of A(s, g, 7) is x(—Np) x €’ = e’, since the primes dividing N 
split in K, the field K is imaginary quadratic, and p is inert in K /Q. Consequently, 
the sign of A(s, g)x A(s, g, x), which is the complete L-function of the base change 
of gto K,is +1. 

The Birch and Swinnerton-Dyer conjecture for 4/K, when combined with the 
conjecture of Deligne and Gross [D79, Conj 2.7], predicts that the dimension of 
of the F-vector space F @r A(K) is equal to the order of vanishing of L(s, g) x 
L(s,g, x) at s = 1. From the sign of the functional equation, one thus expects 
F @pr A(K) to have even dimension. Theorem 2 implies that the dimension is 0, if £ 
does not divide P in E(K,), since 1~!/R ®z A(K) injects into Sel((A,i)/K, 1) = 
0. If £ does divide P in E(K,), then by Theorem 2, the group Sel((A,7)/K, /) 
is 2-dimensional over R/J. If II1(A/K) is finite, then one can check using the 
Cassels—Tate pairing that the dimensions of Sel(A/K,/) and of I~!/R @pr A(K) 
have the same parity; thus F ®pr A(K) has dimension 2 or 0, according to whether 
Ill(A/K)[/] = 0 or not. 

Note that if £ does not divide P in E(K,), then, since the dimension of F ®r 
A(K) is 0, one finds that the rank of A/Q is 0. The following lemma shows that this 
conclusion is compatible with the parity prediction of the Birch and Swinnerton- 
Dyer conjecture over Q. 


Lemma 10. Suppose that € does not divide P in E(K,). Then the sign €' of the 
functional equation of A(s, g) is +1. 


Proof. Owing to the congruence between f and g, we have 


ap(f) = 4p(g)(p + 1) (mod A), (6.1) 
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as one sees by comparing the local Galois representations at p associated to f 
and g. According to Lemma 1, the congruence in (6.1) holds with a,(g) replaced 
by —e. Since p + 1 is invertible modulo ¢ by assumption f), we have, therefore, 
ay(g) = —€ (mod A). As ap(g) = +1 and ¢ F 2, we conclude that ap,(g) = —e. 

Let a be the root in Qy of the Hecke polynomial T? — a,(f)T + p that is 
congruent to —e modulo A, and let 6 be the other root. Consider the oldform with 
coefficients in Ze 


h(z) = f(z) — BF(p2) 


in the old space for f at level Np. The form /(z) is an eigenfunction of the Hecke 
operators T, for g prime to Np and of U, for g dividing Np; the eigenvalues at q 
prime to p agree with those of f, and at p, we have U,h = ah. Since a = —e 
(mod A), the reductions ¥,h € S2(Io(Np), F:) are eigenfunctions for the Hecke 
operators T, for g prime to Np and U, for g dividing Np with the same eigenvalues. 
Since both of these cuspidal eigenforms have a; = 1, we have = h by the q- 
expansion principle. 

For a prime g dividing N, let W, be the Atkin—Lehner involution at q for level 
N. Since f is a newform with new level N, it is an eigenfunction of all of the W,. 
Let W, f = €, f. Then the sign € is —[ [jy €q. 

For each q dividing Np, let W, be the Atkin—Lehner involution at g at level Np. 
Since g is a newform at level Np, it is an eigenfunction for all of the We One has 
W), = —U,, and so W,g = eg. For q dividing N, we have Wjh = e,h, and so 
Wig — Wih = €,8. Therefore, Wig = €,g for q dividing N. The sign e’ of the 
functional equation for A(s, g) is thus 


a, De 
~ex|[[e, = =+1. oO 
q|N 


7 Examples 


We now give two examples. The data consist of a 4-tuple (£, K, p, £) satisfying 
hypothesis (a)—(f). The examples are chosen so that the lifted form g has rational 
Fourier coefficients. In this case, we have Q(g) = Qand R = Z,; the abelian variety 
(A, i) is an elliptic curve, and the /-descent on (A, i) is simply the €-descent A. We 
wish to thank Noam Elkies and William Stein for help with the computations. 

The first example is: 


E = X,(57)/(W, Wi) 3=N =57 
K = Q(V-59) h=3 
p=2 az = —2 
£=5 
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The minimal equation of F is (cf. [BK75]): 

yy =x? — 22x42 A = —37.19, 
and the modular form f associated to E has q-expansion 

f =Ga0g =_ £29 =39" + 29° = Sq" # ccs 
The sign in the functional equation of L(s, E) = L(s, f) is € = —1, and E(Q) is 
free of rank 1, with generator e = (2, 1). The Heegner point P associated to K is 
equal to +2e. This is not divisible by € = 5 in E(K,) = E(K2). 
There is a unique newform g of weight 2 and level 114 = Np where Wj = 

W/, = +1 and Wy = « = —1. It has g-expansion 

g=qt+¢-Gt+qi+2q-q°+0q'+... 


congruent (mod 5) to the old form f(t) —2f(2rT). The elliptic curve A has minimal 
equation 


yrtxyty =x? 4 x7-—352x—2431 A = 233.19, 


The Selmer group Sel(A/K, 5) = 0, and A(K) has rank 0. 
The second example is 


E = X,(26)/(Wo) N =26 
K = Q(V—79) h=5 


p=3 az = 1 


The minimal equation of F is (cf. [BK75]): 
ytxyty=x—5x-8 A=-2P17 
and the modular form f associated to E has g-expansion beginning 
Fag aG Fe EP 8G Hag! hes 
The sign in the functional equation of L(s, E) = L(s, f) ise = +1, and E(Q) = 
Z/3Z. 


The Heegner point P associated to K has x-coordinate x(P) = 1700/711, and 
generates the free group E(K)/torsion. On the other hand, since the denominator 
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of x(P) is divisible by p = 3, the point P reduces to the identity (mod 3) and is 
divisible by £ = 5 in E(K3). We note that E has three points over the field with 
p = 3 elements, and fifteen points over the field with p? = 9 elements. 

In this case, g is the unique newform of weight 2 and level 78 = Np. It has 
Fourier expansion beginning 


g=q-G—-G+get2Pt+qo+4q' +... 


and is congruent to the old form f(t) + 3f(3t) (mod 5). The elliptic curve A has 
minimal equation 


yr txy =x? +x*— 19x + 685 A = —2!°,3°.13 


and rank 0 over Q. Since P is locally divisible by 5 in E(K3), the Selmer group 
Sel(A/K, 5) has dimension 2 over Z/5Z. In fact, A(K) has rank 2. In the twisted 
model over Q: 


yy +xy ty =x? — 121830x — 341716424 
A = —2!© 3°.13.79° 


the points 
(x, y) = (2732, 139056) 
(x, y) = (410357, —263076219) 
generate the Mordell—Weil group modulo torsion. 


Appendix A. Descent with endomorphisms 


In this appendix, we give an expanded discussion of the /-descent used on the 
abelian variety A/K and its Néron model above. In order to have results general 
enough to apply easily to group schemes such as the Néron model, we begin very 
generally with some abstract constructions on sheaves. We then specialize to the 
sheaves represented by smooth group schemes, explaining how to apply the abstract 
results to such cases. 


A.l_ First descent with endomorphisms 


Let R be a commutative, unital ring, and let J C R be an ideal. Let A be a sheaf 
of R-modules on some site S, and assume that A is /-injective in the sense that the 
map of sheaves 


A = Hom,(R, A) > Hom,(J, A) 
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is surjective. (Note that for any R-module M, the presheaf T +> Homar(M, A(T)) 
is a sheaf.) The kernel of this map is Hom,(R/J, A). Associating to a sheaf 
homomorphism R/J — A the image of the Section 1 identifies Hom,(R/J, A) 
with A[/], the sheaf of sections of A killed by elements of J. We thus have an exact 
sequence of sheaves 


O—— + All] > A ———» Hom, A) ——— > 0. 


The boundary map 
Hompy(J, A(S)) > H!(S, A[/]) 


in the associated long exact sequence in cohomology is the Kummer map. From 
the long exact sequence, we see that the kernel of the Kummer map is the image 
of Homr(R, A(S)) = A(S) in Homer(J, A(S)). Similarly, the cokernel of the 
Kummer map is the kernel of 


H'(S, A) > H'(S,Hom,(I, A)). 
If J is invertible as an R-module, then we have 
Hom,(I, A) = 17! @p A, 
where J~' = Homa(J, R). Here I~! @p A denotes the presheaf 
Tt>I7!' @r A(T), 


which is a sheaf since /~! is a flat R-module. Thus we have the exact sequence of 
sheaves 


0 —— All] —— A —— I! @ep A —— 0. 
The kernel of the Kummer map 
I~! @p A(S) > H'(S, A[Z]) 


is the image of R@p A(S) = A(S), and so the Kummer image is (J~!/R)@p A(S), 
or, equivalently, it is 7~! @r (R/I @r A(S)). In this case, we also have 


H'(S,17! @g A) = 17! @g H'(S, A). 
The cokernel of the Kummer map is then the kernel of 
H!(S, A) > I~! @r H'(S, A), 


which is H!(S, A)[/], the /-torsion of the R-module H!(S, A). 
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If S is the small étale site of Spec(K) for a number field K, we define 
Sel(A/K, J), the /-Selmer group of A, to be the subspace of classes x € 
H!(K, A[J]) whose local restrictions x, € H!(K,, A[/]) lie in the image of 
Homar(/, A(K,)) under the local Kummer map for all places v of K. As in the 
standard case with R = Zand J = (£), the global Kummer map 


Homp(I, A(K)) > H!(K, A[/]) 


factors through Sel(A/K, /). 

In topology the most familiar analogue of the above formalism is the following: 
let M be an R-module that has no J-torsion, so that M — Homa(J, M) is injective. 
Let N be the cokernel, so that we have the exact sequence of constant sheaves of 
R-modules on any topological space X 


0 ——> M ——> Hom,(/,M) ——~> N ——> 0. 
Passing to cohomology, we find 
—— H(X,M) ——} H'(X,Hom,(/,M)) ——> 
H'(X,N) ——> H't!(x, M) ——> 
which is a Bockstein-type sequence in the case R = Z and I = (£). 
To recast the above discussion in this style, we replace the sheaves A and 
Hom, (J, A) with the complexes 


M=Al[-1] and M'=Homa(J, A)[-I], 


concentrated in degree 1. The /-injectivity of A translates into something like 
7-torsion freeness of M: the exact sequence of sheaves 


0 —— All] ——~> A —— Hom,(/, A) —— 0 
gives rise to a distinguished triangle 
M > M' = A{l] > M{I]. 


The sheaf A[/] thus plays the same role as the cokernel N in the picture over a 
topological space X , and the Kummer map is 


Hom(/, A(S)) = H!(S, M’) > H'(S, A[/]). 
For a concrete example, let S be the fppf site of a field K, and let A is the 


sheaf represented by an abelian variety over K on which R acts as endomorphisms. 
Assume that J is invertible as an R-module. Then, as explained in the next section, 
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Hom,(/, A) = I7~' @p A is also represented by an abelian variety, on which R 
acts as endomorphisms. Furthermore, the natural map A > I~! @p A of sheaves is 
surjective. The complexes M and M’ are the 1-motives associated to these abelian 
varieties, and the surjection of sheaves A — Homp(J, A) becomes an injection of 
motives M — M’. The sheaf A[/] (in degree 0) stands in for the (hypothetical) 
torsion motive coker(M — M’). 


A.2 Smooth R-module schemes 


Let R and J be as in §A.1. Assume moreover that J is finitely presented as an R- 
module. Let X be a scheme and let G/X be an R-module scheme. The choice of 
a presentation of J produces a scheme representing the fppf sheaf Hom,(/, G): let 
R®" > R®’ —» I + 0 be an R-module presentation. Then the kernel of G®” > 
G®* represents Homp(I,G). We fix a presentation of J and write Hom,(/, G) 
for the representing scheme as well as the sheaf. Many scheme-theoretic properties 
of G/X carry over to Hom, (/, G). For instance, if G/X is separated (resp. quasi- 
compact, quasi-separated, locally of finite type, locally of finite presentation, proper, 
...), then so is Homp(/, G)/X. Note finally that if J is invertible as an R-module 
and G/X has connected geometric fibers, then so does Hom, (J, G): the surjection 
R®’ — J splits, and so Hom,(/, G)/X is a factor of G®’ / X, which has connected 
geometric fibers. 

Assume from now on that G/ X is smooth and that J is invertible as an R-module. 
It follows from the functorial criterion for smoothness that Hom, (J, G) is smooth 
over R. Furthermore, there is a natural isomorphism Hom,(/,Lie(G/X)) = 
Lie(Hom,(/, G)/X), and so the relative dimensions of G/X and Hom,(/, G)/X 
agree. 

The kernel and cokernel of the map on Lie algebras 


Lie(G/X) > Lie(I~! @p G/X) = I7! @p Lie(G/X) 


are coherent sheaves on X on which R acts and that are annihilated by /. Thus 
if 7 contains the image of € € Z such that ¢ is invertible on X, this map is an 
isomorphism, and G > I~! @ x G is étale. By [SGA70], Exposé VIB, Proposition 
3.11, if G/X has connected geometric fibers, the map G — Hom,(J/, G) is étale 
and surjective, and so the sheaf on the small étale (or fppf or fpqf) site of X 
represented by G/X is /-injective. 

Alternatively, assume that G/X has connected and semi-abelian geometric 
fibers, and that J contains the image of any non-zero £ € Z. The multiplication- 
by-€ endomorphism of G is then surjective. Since G — I7! @p G factors 
through multiplication by €, the geometric fibers of I~! @r G/X are connected, 
and the relative dimensions of G/X and I~! @r G/X agree, the homomorphism 
G — I! @p G is also surjective. By [SGA70], Exposé VIB, Proposition 3.11, the 
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homomorphism G + I~! @ x G is then flat and surjective. Consequently, the sheaf 
on the fppf (or fpqf) site of X represented by G/X is I -injective. 

In either of these cases, since G > I~! @p G is flat, the kernel G[J]/X is 
flat and quasi-finite for dimensional reasons. If G/X is proper, then G[/]/X is 
moreover finite and locally free. 
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Uniform estimates for primitive divisors in 
elliptic divisibility sequences 
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Abstract Let P be a nontorsion rational point on an elliptic curve EF, given by 
a minimal Weierstrass equation, and write the first coordinate of nP as A, /D?, 
a fraction in lowest terms. The sequence of values D,, is the elliptic divisibility 
sequence (EDS) associated to P. A prime p is a primitive divisor of D, if p divides 
D,,, and p does not divide any earlier term in the sequence. The Zsigmondy set for 
P is the set of n such that D,, has no primitive divisors. It is known that Z is finite. 
In the first part of the paper we prove various uniform bounds for the size of the 
Zsigmondy set, including (1) if the j-invariant of E is integral, then the size of the 
Zsigmondy set is bounded independently of E and P, and (2) if the abc Conjecture 
is true, then the size of the Zsigmondy set is bounded independently of F and P for 
all curves and points. In the second part of the paper, we derive upper bounds for 
the maximum element in the Zsigmondy set for points on twists of a fixed elliptic 
curve. 
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1 Introduction 


Let E/Q be an elliptic curve given by a Weierstrass equation 


y? tayxy +azy = x? + agx* +.a4x +46, a1,...,46EZ (1) 


and let P € E(Q) be a point of infinite order. For each n > 1 the n" iterate of P 
has the form 


where the fractions are written in lowest terms and we assume that D, > 0. The 
sequence Dg p = (Dy,)n>1 is called the elliptic divisibility sequence associated 
to E and P. It is a divisibility sequence in the sense that if m|n, then D,,|D,, and 
in fact it satisfies the stronger divisibility relation 


Decan,n) = ZCU(Din, Dn) for allm,n > 1. 


(See Section 1.1 for a general statement over Dedekind domains.) 

The study of the arithmetic properties of elliptic divisibility sequences was 
initiated by Morgan Ward in the 1940’s [34, 35] and has seen a surge of interest 
in recent years, see for example [1, 2,5, 6, 8-13, 18, 23, 31-33]. 

IfC = (C,,)n>1 is any divisibility sequence, one says that a prime p is a primitive 
divisor of C, if p|C, but p + C,Cz---C,—1. Primitive divisors of certain divisibility 
sequences were studied by Zsigmondy [37] in the 19" century. We define the 
Zsigmondy set of a divisibility sequence C to be 


Z(C) = {n => 1: C, does not have a primitive divisor}. 


Zsigmondy was especially interested in divisibility sequences defined by binary 
linear recurrences satisfying appropriate growth conditions. Bilu, Hanrot, and 
Voutier [4] recently completed the proof that all of these Lucas and Lehmer 
sequences satisfy max Z(C) < 30, and there are examples to show that this bound 
is sharp. They also completely describe all such sequences with max Z(C) > 12. 
We note that Lucas divisibility sequences are associated to singular elliptic curves, 
so the material in this paper is, in some sense, a direct generalization of these 
earlier results. 

It is a nontrivial fact that the Zsigmondy set Z(Dz,p) of an elliptic divisibility 
sequence is finite, see [29]. It is natural to ask if there is a uniform bound 
for Z(D¢,p) as there is for the case of binary linear recurrences. The answer is no 
unless some care is taken, since a simple change of variables (x, y) +> (u?x,u>y) 
allows one to multiply every term of the sequence by a power of u. This is the 
same trick that allows the creation of elliptic curves with arbitrarily many integer 
points, and the solution to both problems is the same, namely restrict attention to 
minimal Weierstrass equations. With this restriction, we prove a reasonably strong 
uniform bound for the number of elements in the set Z(Dz,p), and assuming the 
abc-conjecture, we show that #Z(D¢_p) is bounded independently of EF and P. 
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Theorem 1. Let E/Q be an elliptic curve given by a minimal Weierstrass equa- 
tion (1), let P € E(Q) be a point of infinite order, and let Dg,p be the associated 
elliptic divisibility sequence. 


(a) #Z(De,p) is bounded by a constant depending only on the number of primes at 
which E has split multiplicative reduction. 

(b) If the abc-conjecture over Q is true, then there is an absolute bound for 
#Z(De,p) that is completely independent of E and P. 


Remark 1. Primes of split multiplicative reduction necessarily divide the denom- 
inator of the j-invariant, so a slightly weaker version of (a) is that #Z(D¢p) 
is bounded by a constant depending only on the number of primes dividing the 
denominator of j(£). So, for example, it is unconditionally true that there is an 
absolute bound for #Z(D_z.p) as E varies over all elliptic curves with integral 7- 
invariant. 

For (b), we prove an unconditional theorem that implies the stated result. We 
show that #Z(D¢_p) is bounded by a constant depending only on the Szpiro ratio 
of E'/Q defined by 


log | Discriminant F/Q| 
log | Conductor E/Q| © 


Szpiro Ratio(E/Q) = 


It is well known that the (weak) abc-conjecture implies that the Szpiro ratio 
is bounded independently of £. More precisely, Szpiro has conjectured that for 
any € > 0 there are only finitely many elliptic curves E'/Q whose Szpiro ratio 
exceeds 6 + €. See [16] for a discussion. 


Remark 2. We actually prove a general version of Theorem | over number fields. 
See Theorem 7 in Section 1.3 for the exact statement. 


Theorem | gives uniform bounds for the size of the Zsigmondy set Z(Dz,p), but 
it does not provide an effective bound for the largest element. Such upper bounds 
are not known in general for elliptic divisibility sequences, but various partial results 
are known. For illustrative purposes, we quote a result due to the first author. 


Theorem 2. (Ingram [18]) Let N be a squarefree integer, let E be the elliptic curve 
y? = x3 — N*x, and let P € E(Q) be a point of infinite order. Then 


Z(De.p)M2Z C {2} and Z(De.p)15Z= @. 


Further, if P € 2E(Q) or if x(P) <0, then Z(Dg.p) C {1, 2}. 


The main contribution of [18] is to provide, for fixed n > 3, an effective method 
for finding all elliptic divisibility sequences Dp arising from curves of the above 
form, such that € Z(D¢,p). The problem of finding all such sequences is reduced 
to that of solving a certain Thue—Mahler equation involving the binary form 


[[@-xey), 
Q 
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where Q ranges over points on E (Q) of exact order n. Note that this is entirely 
analogous to the method used in [4], originally outlined by Schinzel [21], wherein 
the problem of finding all Lucas sequences C such that n € Z(C) is reduced 
to solving a Thue-Mahler equation involving the mth cyclotomic polynomial. 
Unfortunately, the analogy to [4] does not provide a bound on max Z(Dz__p), but 
bounds produced by ad hoc methods in [10] can be reduced, using the above 
observation, to those presented in Theorem 2. 

This idea rests on the observation that the points of order on E vary predictably 
as E runs over a family of quadratic twists of a fixed curve. Thus, one may show 
that, for fixed n > 3, the collection of elliptic divisibility sequences Dg p such that 
n € Z(Dz.p), where EF runs over the quadratic twists of any fixed elliptic curve, is 
finite and effectively computable. In Section 1.4, we present a result to this effect 
over number fields. Note that the proof can easily be modified to treat families of 
curves defined by quartic or sextic twisting, as in [18]. 

The methods used in [10] to obtain bounds on the largest element of Z(D <p) 
 2Z make critical use of the existence of rational points of order two on the 
curves in question, affording one a strong, explicit lower bound on the denominators 
of points that are divisible by two in the Mordell—Weil group. If one is willing 
to forsake the effective computability of these bounds, one may generalize these 
techniques. In the proof of theorem 6 we use Roth’s theorem on diophantine 
approximation to show that 


max(Z(Dz,p) M pZ) 


may be bounded for any sufficiently large prime p if EF ranges, again, over the 
quadratic twists of a fixed elliptic curve. 

Finally, in Section 1.5 we turn our attention back to elliptic divisibility sequences 
over Q with the aim of seeing what certain common conjectures tell us about 
max Z(Dg,p). We show that if Hall’s Conjecture is true, then Z(Dg.p) C 
{1,2,3,4} for all but finitely many elliptic divisibility sequences arising from 
(minimal) curves of the form E : y? = x? + M. Analogous results can be obtained 
for other families with fixed j-invariant if one accepts a generalization of Hall’s 
Conjecture due to Lang. Even with these generous assumptions, a uniform bound 
on Z(De_p) seems out of reach. 


Acknowledgements The first author’s research supported in part by a grant from NSERC of 
Canada. The second author’s research supported by NSA grant H98230-04-1-0064. 


1.1 Elliptic divisibility sequences over Dedekind domains 


In this section we prove some basic theorems concerning elliptic divisibility 
sequences over characteristic 0 Dedekind domains R. Let K be the fraction field 
of R and let E/K be an elliptic curve given by a Weierstrass equation (1) with 
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coefficients a; € R. For any nonzero point P = (xp, yp) € E(K) we define the 
denominator ideal of P to be the ideal Dp C R specified by 


1 
ord,(Dp) = 5 max{0, — ord, (xp)} for all 0 4 p © Spec(R). 


Here ord, : K* —» Z is the normalized valuation associated to p. The elliptic 
divisibility sequence (EDS) associated to a nontorsion point P € E(K) is the 
sequence of ideals 


Dep = (Dip )n>1- 
Proposition 1. Let Dz,p be an EDS as above and let p € Spec(R) be a nonzero 
prime ideal. Let p be the characteristic of R/p and let ey = ordy(p) be the 
ramification index of p. 
(a) ordy(Dgcaimnyp) = min{ord,(Dmp),ordy(Dnp)} for allm,n > 1. 
(b) Ifordy(Dnp) > &p/(p — 1), then 
ordy(Dgnp) = ordp(Dnp) + ord, (k) forallk > 1. 


Proof. \t suffices to prove the proposition after localizing and completing R and K 
at p. For each integer7 > 1 let 


E;(K) = {O € E(K): —ordy(xg) = 2i}. 
We also let Ey(K) = E(K), which is not quite standard notation, but suffices 
for our purposes. Then all of the E;(K) are subgroups of E,, see for example [27, 
Chap. IV]. We also observe that 
ord,(Do) = max{i > 0: Q € E;(K)}. 


Let m and n be positive integers and let d = gced(m,n). Write d = am + bn for 
some a,b € Z. Further let 


i = ordy(Dinp), j = ordy(Dnp), and k = ord, (Dap). 


Thus mP ¢€ £;(K) andnP ¢€ E;(K). The fact that the £;(K) are subgroups 
of E(K) allows us to conclude that 


dP =a(mP) + b(nP) € E;(K) + Ej (K) = Enmingij}(K). 


Hence 


k = ordy(Dap) > min{i, j}. 
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For the opposite inequality, we use the fact that d|m to conclude that 


m m 
mP = —-dP € —E,(K) C EX(K), 
d d 
soi = ordp(Dmp) = k. Similarly j > k, which completes the proof of (a). 
In order to prove (b), we use the fact that the subgroup EF (K) has the structure 
of a formal group and the E£;(K) form a filtration of subgroups of E,(K). Further, 
fori > e,/(p — 1), there are filtration compatible isomorphisms 


E;(K) — p", (2) 


where the group structure on p! is simply addition. (See [27, Chap. IV] for proofs 
of these basic facts.) 

Now suppose that ordy(Dnp) > ep/(p— 1). Then we can identify nP with some 
z € R satisfying ord,(z) = ordy(D,,p), and the formal group isomorphism (2) tells 
us that 


ordy(Dknp) = ordy(kz) = ordy(k) + ordy(z) = ordy(k) + ordp(Dnp). 


This completes the proof of (b). 


Definition 1. Let Dz,p be an elliptic divisbility sequence as above. The rank of 
apparition of Dz,p at the prime is the smallest integer rp = rp(£, P) with the 
property that p divides D,, p. 


Definition 2. We say that a prime p is exceptional for the elliptic divisibility 
sequence D p if 


(3 
d( Dp) <—_—, 
ord, ( es | 


We define a modified rank of apparition sy, by 


Sp =minjs => 1: ordy(Dsp) > ] i (3) 


Thus p is exceptional if and only if sy > ry. 


Remark 3. If p is exceptional, then necessarily 


K: 
1 < ordy(D,, p) < —2~ < [K:Q) 
pol” pel 


so p < [K : Q| +1. In particular, if K is a number field, then there are only finitely 
many exceptional primes. 


Remark 4. If the given Weierstrass equation for E has good reduction at p, then rp 
is the order of P in the group E(IF,). 


Proposition 2. Let K /Q be a number field of degree d and let Dg. p be an elliptic 
divisibility sequence. Suppose thatm € Z(Dz¢,p) is in the Zsigmondy set of De.p. 
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Then either m = Sy for some exceptional prime p or else 


logNx/oDmp < a logNx/gDep +d log(2d) + d log(m). 


k|m with 
m/k prime 


Proof. Let p be a prime dividing D,,p, let r = rp be the rank of apparition of p, and 
and let s = sy be the modified rank of apparition of p defined by (3). We consider 
several cases: 


Case 1.s +m. 
We know that r|m, so in particular we see that s # r. In other words, the prime p is 
exceptional. Using the strong divisibility property Proposition 1(a), we find that 


ord, (Dgca(sym)P) = min{ordy (Dsp), ordp(Dmp)}. 


The assumption that s + m implies that gcd(s,m) is strictly smaller than s, so the 
definition of the modified rank of apparition tells us that 


ordy (Deca(sm)P) < —e < ordy(D; P). 
Pp —* 


We conclude that 
ep 
pet 


ord, (Dm) = ord, (Deca(s,m)P) s 


Hence the product over all primes in Case | satisfies 


I] porte (Dne) I] [ [se 
p 


P 
ain pip 


[] pv? <2Ik:qQ. 


pS[K:Q\+1 


(For our purposes, it would suffice to know that the penultimate product is bounded 
by a constant depending only on [K : Q]. We do not actually need the sharp bound 
of 2[K : Q].) 


Case 2.5|m ands <m. 
In this case we can apply Proposition 1(b) to obtain the estimate 


ordy(Dmp) = ordy(Dsin/s)p) = Ordp(Dsp) + ord, (m/s). 


We also note that if s,|m with sy A m, then sp necessarily divides some divisor k 
of m having the property that m/k is prime. Hence the product over all primes 
satisfying Case 2 is bounded by 


I] pode Pm) I] ore m I] Dex, 
p 


k|m with P k|m with 
Spl m/k prime m/k prime 


Sp Am 


(In this last expression, the factor of m may be replaced by the squarefree part of m.) 
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Case 3.s =m. 
If p is exceptional, we have ruled this out by assumption. And if p is not exceptional, 
then s = r, and the assumption that m is in the Zsigmondy set implies that m 4 r. 
Hence Case (3) cannot occur. 

We now combine the three cases to estimate the norm of D,,p. To ease notation, 
we let d = [K : Q]. Then 


log Nx/joDmp = > ord, (Dnp) log Nx/op =e ~ ordy(Dinp) log Nxop 


peCase | peCase 2 
< d log(2d) + d log(m) + » logNx/oDxp- 
k|m with 
m/k prime 


This completes the proof of Proposition 2. 

We next prove an inequality relating the terms in an elliptic divisibility se- 
quence Dz p to the heights of the multiples of P. Roughly speaking, the quan- 
tity log Nx/qDq is the nonarchimedean contribution to the canonical height hz (Q), 


so we would expect log Nx/qgD og to be bounded by h E(Q). This is indeed true, and 
we can make the dependence on E explicit by using standard results relating naive 
heights to canonical heights. Thus the following result is comparatively elementary 
compared to Theorem 3 that we prove in Section 1.2. 


Proposition 3. Let E/K be an elliptic curve given by a Weierstrass equation (1), 
let P € E(K) be anontorsion point, and define the height of E to be 


h(E) =1+A(([l,a}7,a$, a3, aj, a)). 
Then 


ma q logNx/gDp < < hre(P) + O(A(E)), 


where the big-O constant is absolute. 


Proof. With appropriate normalizations on the absolute values in K, the absolute 
logarithmic Weil height of x(P) is 


h(x(P)) = > max{0, —v(x(P))}. 


veMxK 


If we sum over only the nonarchimedean places we obtain 
h(x(P)) = 2 max{0, —v(x(P))} = logNx/gDp. 


veMe 


Kal Q| 
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Finally, we use a uniform estimate for the difference between the Weil height and 
the canonical height 


A 1 
h(P) = sh(x(P)) + O(1 + A(E)), (4) 


see [7, 30, 36] for example, where the big-O constant is absolute. 


1.2. A uniform quantitative version of Siegel’s integrality theorem 
for elliptic curves 


A famous theorem of Siegel says that an elliptic curve has only finitely many integral 
points. Siegel actually proved something much stronger. For any point P € E(Q), 
we write x(P) = Ap/Di in lowest terms with Dp > 1 and we set 


1 
h(P) = 5 log max{|A p|, D>}: 


Then Siegel proved that 
log D3, 
li = 1. 5 
pek(Q) h(P) “ 
h(P)—>oo 


(See [27, IX.3.3].) Using the fact that h(nP) ~ nh(P), where /i is the canonical 

height on E, this shows that elliptic divisibility sequences over Q grow very rapidly, 
jit eee =A(P) > 0. (6) 
n—>oo n 

And indeed it is an easy exercise using Siegel’s deep result (6) and the elementary 

estimates given in Proposition | to prove that the Zsigmondy set Z(D,p) of an 

elliptic divisibility sequence is finite, see [29] or [27, Exercise 9.4]. 

Siegel’s proof of the finiteness of F(Z) can be used to give an upper bound 
for #£(Z), but the bound depends rather badly on the equation defining E. 
Dem’janenko in a special case and Lang in general [19] made the following 
conjecture. 


Conjecture 1. (Lang—Dem’janenko) Let E£/Q be an elliptic curve given by a 
minimal Weierstrass equation. Then #£(Z) is bounded by a constant that depends 
only on the rank of E(Q). 


As in Siegel’s work, rather than simply bounding the size of E(Z), one can ask 
for a uniform bound for the number of points in E(Q) that do not satisfy some 
inequality related to the limit (5). A bound of this sort was proven by the second 
author in [28], and in this section we apply the results from [28] to deduce uniform 
information about Zsigmondy sets of elliptic divisibility sequences. 
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However, we need to take some care, because an elliptic curve E/Q and a 
nontorsion point P € E(Q) do not completely determine an elliptic divisibility 
sequence. The reason is that the definition of the associated EDS uses a specific 
Weierstrass equation for E/Q. One solution is to take a global minimal Weierstrass 
equation, which works fine over Q, but unfortunately if K has class number larger 
than 1, then there exist elliptic curves E/K that do not have a global minimal 
Weierstrass equation [3, 26]. In this case one could work with a Néron model 
for E/ K, but instead we will simply put the dependence on the choice of Weierstrass 
equation into our notation. 

Let K be a number field and let Rx be its ring of integers. For a given 5-tuple 
of values a = (1, 42,03, 44,46) € Re, let E, denote the Weierstrass equation (1) 
with the given coefficients and define the height of FL, to be 


h(Ea) = 1+ h([1,a}?,a$, a3, a3, a@]). 


The canonical height hisa positive definite quadratic form on the Mordell—Weil 
group F,(K) modulo torsion, and we write 


\(E,/K) = min{h(Q) :Q€E,(K), O nontorsion}. 


In the language of the geometry of numbers, A(E,/K) is the first minimum of the 
quadratic form h on the lattice E a(K)/Ea(K )tors- 

We can now state the special case of [28] which we need to give a uniform bound 
for the size of the Zsigmondy set of an EDS. 


Theorem 3. With notation as above, for all € > 0 and all d > | there is a 
constant C = C(e,d) with the following property: Let K/Q be a number field 
of degree at most d, leta € R% be a 5-tuple so that Eg is an elliptic curve, 
let P € E,(K) be a nontorsion point, and let M > 1. Form the elliptic divisibility 
sequence (Dyp)n>1 as described in Section 1.1. Then the set 


kg 5 log Nxq(Dnp) < (1—€)n?h(P) + Mh(E,) 


has at most C \/Mh(E,q)/A(Ea/K) elements. 


Proof. This is a version of Theorem 4.1 in [28], see also [14] for similar results with 
explicit (albeit huge) constants. We briefly indicate how the results in [28] imply our 
statement. A direct application of [28, Theorem 4.1] to the family of Weierstrass 
equations E — P>° yields 


# P € E,(K): Ax,(0)(P.v) = €he,(P) — Mnce)} 


veS 


Mh(Ea) 5 rank Ea(K) 
) : (7) 


<#E(K ae C'+#S tank Ea(K) 
< #Eg(K tors T(E, /K) 
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(We refer the reader to [28] for a complete description of the notation. In particular, 
the constant C depends only on [K : Q] and €.) This is not quite what we want, 
since we are dealing with a rank-one torsion-free subgroup of E,(K), namely the 
subgroup generated by a particular point P € E,(K). However, the proof in [28] 
is easily adapted to subgroups of E(K), so in (7) we can replace E,(K) by the 
set {nP :n € Z}, which also means that we put #F4(K) tors = 1 and rank F4(K) = 
1. Further, we take S = Mf to be the set of archimedean places of K, so we can 
absorb the dependence on #S into the constant C. Then for P € E,(K) we have 
the estimate (for a new constant C = C([K : QJ, «)) 


rn Mh(E, 
Hn > 1: \° Ag «oy(aP.v) = che, (nP) — MnEs)} < Chae (8) 
veM 
The local height function Ag, (a) is given by 
Az,«0)(Q,¥) = 5 max{0, —v(x(Q))}, 


where the valuations are normalized so that the absolute logarithmic height of a € 
K* is given by the formula h(a) = 5°, max{0, —v(a)}. Hence 


1 
> A8,.(0)(Q.¥) = 5D) max{0, —v(x(Q))} 


me mee 


1 1 
= sh(x(Q)) — 5D maxt{0, —v(x)} 
Me 


1 
= 5h(x(Q)) - ma a logNxjq(Do) 


hp,(Q) + O(h(Ea)) — 


rata q log Nxja(Do). 


(For the last line we have again used (4). Note that the big-O constant is absolute.) 
Putting O = nP yields 


y, Ae,(0)(0P, v) = he,(nP) — 


Me 


a 0] log Nx/o(Dnp) + O(h(Ea)). 


Substituting this into (8) gives 


n> >1:(1—ehg,(nP) + (M — Ch(E,) = log Nija(Dar)} 


[K : Q] 


Mh(E,) 
=i © 
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where C’ > 0 is an absolute constant. Finally, we replace M by M + C’, use the 
inequality M+C’ < (1+C’)M, and increase the value of C accordingly. Then (9) 
and the canonical height property h (nP) = nh(P) give the desired result, which 
completes the proof of Theorem 3. 


1.3 A uniform Zsigmondy estimate 


We have now assembled all the tools needed to prove a uniform bound for the size 
of the Zsigmondy set of an elliptic divisibility sequence. 


Theorem 4. Continuing with the notation from Sections 1.2 and 1.1, leta € RX so 
that Ey is an elliptic curve and let P € E,(K) be a nontorsion point. Then there is 
a constant C = C([K : Q]) such that 


#Z(De,,p) < Ch(Ea)/A(Ea/K). 


Proof. As noted in Remark 3, the number of exceptional primes is bounded by a 

constant depending only on [K : QJ], so without loss of generality we may discard 

from Z(Dz,,p) all m with the property that 7 = s, for some exceptional prime p. 
Let m € Z(Dz,,.p). Then Theorem 3 (with e = ‘ and M = 1) says that with at 


most O(./h(E,)/A(Ea/K)) exceptions, we have 


Ko] G log Nx/q(Dmnp) = sm h(P), (10) 


In the other direction, Propositions 2 and 3 allow us to estimate 


Vaca 0] log Nx/oDmp 
1 

< > ——— logNx/qDxp + log(2[K : Q]) + Vm log(m) 
k|m,kAm [K : Q| 

< So (A(kP) + O(h(Es))) + log(2[K : Q]) + Vmlog(m) 
k|m,kAm 

> kPh(P) + O(mh(E,)). (11) 

k|m,k4#m 


We also have the trivial estimate 


2 k? <m? > a <6) - 1). (12) 


k|m,k#m k|m,k>2 
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Combining (10), (11) and (12) yields 
(2 —¢(2)) m7h(P) < O(mh(E,)). 
Since 7 — €(2) = 0.105 > 0, we find that 
m < O(h(Ea)/h(P)) < O(h(Eq)/A(Ea/K)). 
All of the big-O constants depend only on [K : QJ, so this completes the proof of 


Theorem 4. 


In order to apply Theorem 4, we need some sort of upper bound for the 
ratio h(E,)/A(E,/K). The denominator depends only on the K-isomorphism class 
of E,, while the numerator depends on the particular Weierstrass model. For 
example, if we let 


uxa= [l,ua1, war, uaz, u*a4, u°ae), 


then E..a is K-isomorphic to Eg, but it is not hard to see that h(E,sa) = A( Eg) + 
12h(u) + O(1). Thus in order to obtain a completely uniform upper bound in 
Theorem 4, we must put some restriction on the choice of a. 


Definition 3. Put a partial order on Ry by setting a < b if 
Eg =/K Eb and Nxjo Disc(E,4) < Nx/o Disc( Ey). 


A Weierstrass equation F, is called K-quasiminimal if a is a minimal element for 
this partial order. 


It is clear that every elliptic curve has a quasiminimal Weierstrass equation. The 
following is a natural generalization of a conjecture of Lang [19], which he made 
based on some preliminary work of Dem’janenko. 


Conjecture 2. (Lang) Let K /Q be a number field. There is a positive constant C = 
C(K) such that for all K-quasiminimal Weierstrass equations E, over K we have 


A(E,/K) z= Ch(E,). 


Clearly Lang’s Conjecture 2 combined with Theorem 4 imply that the size of 
the Zsigmondy set of an elliptic divisibility sequence on a K-quasiminimal elliptic 
curve is bounded by a constant depending only on K/Q. We mention two other 
conjectures that turn out to be related to Lang’s conjecture. 


Definition 4. The Szpiro ratio of an elliptic curve E/K is the quantity 


log Nx/g Disc(E/K) 


DE) 5 Nx/q Cond(E/K)’ 
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where Cond(£/K) is the conductor of E/K. (For our purposes, it would suffice 
to replace Cond(£/K) with the product of the primes at which E/K has bad 
reduction.) 


Conjecture 3. (Szpiro) For any € > 0 there are only finitely many elliptic 
curves E'/K satisfying o(E/K) > 6+ €. 


It is well known that the abc-conjecture of Masser and Osterlé implies Szpiro’s 
conjecture. Less obvious is the fact that Lang’s conjecture is a consequence of 
Szpiro’s conjecture. 


Theorem 5. (Hindry—Silverman [16]) Szpiro’s Conjecture 3 implies Lang’s Con- 
jecture 2. More precisely, there is a positive constant C = C(K) such that for all 
K-quasiminimal Weierstrass equations E, over K we have 


AE,/ K) = Chen): 


We also quote another partial result on Lang’s conjecture in which the constant 
depends in a mild way on the elliptic curve. 


Theorem 6. (Silverman [25]) With notation as above, let v(E,) be the number of 
primes of K at which Ey has split multiplicative reduction. Then 


A(E,/K) > Ct) p(k, ). 


Combining all of this material yields a number field version of the result 
(Theorem 1) stated in the introduction. 


Theorem 7. Let K/Q be a number field, let E/K be an elliptic curve given by a 
K-quasiminimal Weierstrass equation, let o(E/K) be the Szpiro ratio of E/ K, and 
let v(E/K) be the number of primes of K at which E/K has split multiplicative 
reduction. Let P € E(K) be a point of infinite order, let Dg.p be the associated 
elliptic divisibility sequence, and consider its Zsigmondy set Z(Dg,p). There is a 
constant C = C(K) depending only on K such that: 


(a) #Z(Dz,p) < CITN/®), 
(b) #Z(Dz,p) < C+0/®), 


If Szpiro’s conjecture or the abc-conjecture is true, then #Z(Dz.p) is bounded by a 
constant that depends only on K. 


Proof. The estimate in (a) follows by combining Theorems 4 and 6, and the estimate 
in (b) follows similarly by combining Theorems 4 and 5. The final statement is clear, 
since Szpiro’s conjecture says that o(£/ K) is bounded independently of £, and it 
is well known that the abc-conjecture implies Szpiro’s conjecture. (Note that we 
actually only need a weak version of either conjecture, i.e., it suffices to have any 
exponent, we do not need 6+¢ in Szpiro’s conjecture or 1+€ in the abc-conjecture.) 
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1.4 Results for quadratic twists 


Theorem 7 bounds #Z(Dzg,p) for all curves E/K given by a K-quasiminimal 
Weierstrass equation with at most a fixed number of primes dividing the denomi- 
nator of j(£). It is certainly true, then, that #Z(D¢_p) is bounded as E varies over 
the K-quasiminimal quadratic twists of a fixed curve. It is natural to ask whether, 
in this specific context, one can bound max Z(D¢,p) uniformly. Although a result 
of this sort seems out of reach at the moment, we can prove a strong bound in the 
case in which P is in the image of an isogeny. In particular, if we consider, for a 
sufficiently large prime p, nontorsion points P € pE(K) as E runs over (minimal) 
quadratic twists of some fixed curve, we can show that max Z(D¢_p) is bounded by 
a constant depending only on j(£) and p. 

Explicit results along these lines, over Q and with p = 2, are given in [10], and 
we use similar techniques to obtain the more general results described above. In 
[18], the first author gave sharpened estimates for max Z(Dg,p) when P € E(Q) 
and j(E) € {0, 1728}. Specifically, if we consider minimal E’/Q of the form 


y=x+Ax or y2=x°+A, 


then for a fixed integer k greater than 3 in the first case and 4 in the second, it 
was proven that the set of pairs {(A, P) : k € Z(Dz,p)} is finite and effectively 
computable. Thus any bound on max Z(D¢_p) for a family of elliptic divisibility 
sequences arising from these curves may be, with a finite number of exceptions, 
reduced to a bound of 3 or 4, respectively. Furthermore, this finite set of exceptions is 
effectively (although often not practically) computable. We will extend these results 
to families of twists over number fields. For the remainder of the section we will, for 
simplicity, work with elliptic curves in short Weierstrass form (i.e., with ay = az = 
a3 = 0, in the notation above). We will say that such a model is K-quasiminimal 
if it has minimal discriminant among K-isomorphic curves in the same form. Such 
curves might not be K-quasiminimal in the sense above, but will be away from 
primes dividing 2 or 3. 

To prove the next two theorems, we require tools that trace back, in spirit, to 
the original paper of Ward [35]. Ward considers sequences (hy )nez satisfying the 
relation 

hn t nlm = hin t thm ih, = hy t An ih, (13) 


T m? 


with ho = 0, hy = 1, and ho|h4 (a definition that makes sense over any integral 
domain). It is worth noting that the sequences described in [35] do not correspond 
directly to the sequences discussed here, and we reserve the term elliptic divisibility 
sequence for the latter. For example, if we consider the elliptic divisibility sequence 
Dg.p defined by the point (12, 36) on the elliptic curve y? = x — 36x, we have 


Dp = (1), Dop = (2), Dap = (23), Dap = (140), Dsp = (52487)... 
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It is clear that we cannot choose generators h, € Z of the ideals D,p which satisfy 
the recursion (13), in particular because this would require (setting m = 3 and 
n= 2) 

52487 = |hs| < |hah3| + |h3hi| = 13287. 


Although our sequence Pz _p is not necessarily an elliptic divisibility sequence 
in the sense of Ward, one may associate a Ward-type sequence to it. Recall (from, 
e.g., [27, p. 105]) the division polynomials of an elliptic curve 


E:y?=x?+Ax+B. 
They are elements of the function field K(£) defined by setting 


wo=0, W=1, Ww=2y, ws = 3x4+6Ax? 4+ 12Bx — A’, 
Wa = 4y(x° + 5Ax* + 20Bx° — 5A?x? — 44Bx — 8B? — A?), 


and then pe the recursion (13) to define the other y,. It is an easy exercise to 
show that 2? may be written as a polynomial in x for all n. More precisely, Wn 
(espectively W,/y) may be written as a polynomial in x if n is odd (respectively 
even). We will abuse notation by writing either y,(P) or W,(xp) when wy, € K(x). 

If P € E(K), then (W%(P))nez is a sequence (in K) satisfying (13), by 
construction. While we would like to consider sequences in R, rather than K, which 
relate to Dp, it is difficult to do this if R is not a principal ideal domain. Note that 
if So is a finite set of primes of R, it is always possible to extend So to a finite set S 
such that the localization Rs of R at S is a principal ideal domain. If the Rs-ideal 
Dp Rs is principal, and is generated by ¢, then a simple induction shows that the 
sequence 


hy = 1" —!W(P) (14) 


is a sequence in Rg satisfying (13). This sequence depends on the choice of f, but 
only up to multiplication by S-units. The following proposition, due essentially to 
Ward [35] and Ayad [1,2], indicates how this sequence relates to Dz p. 


Proposition 4. Let E be an elliptic curve as above, let P € E(K) be a point of 
infinite order, let S be a finite set of primes such that Rs is a PID, and let p € 
Spec(Rs) be a nonzero prime ideal. 


(a) Foralln, 
ord, (h,) = ordy(Dnp/Dp). 


(b) If ord, (hn) > ordy(Dnp/Dp) for any n, then p is a prime of bad reduction for 
E, and P has singular reduction modulo p. 


Proof. To prove (a), we note that if t € Rs is a generator for Dp Rs, and we set 


kn, = ae (xpWn(P) = Wnti(P)Wn- \(P)) € Rs, 
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for each n, then x,p = k,/(th,)* (see [27, p. 105]). As k, and h, are S-integral, 
and 


1 
ord, (Drp) = 5 max{0, — ord, (Xnp)} 


it follows at once that ord,(D,p) < ord, (th,,), and part (a) follows. Part (b) follows 
from Théoréme A of [1]. 


We now turn our attention to families of quadratic twists of a fixed elliptic curve 
over K. Let A, B € R be non-zero, and let 


E:y?=x3+Ax+B 
be a K-quasiminimal elliptic curve. Another elliptic curve 
EB’: y?=3x°+A'x +B! 
with A’, B’ € R, is a quadratic twist of E if there is a K-isomorphism from E to E’. 
Writing down the possible isomorphisms between curves of this form [27, p. 49], 


we see that we must have A’ = t*A and B’ = 7°B, for some t € K. Furthermore, 


> AB’ 


v= Ee K. 
A'B 


Note that if S’) C Spec(R) is a set of nonzero primes containing all primes dividing 
AB(4A? + 27B?), and such that Rs is a principal ideal domain, then E’ can be 
K-quasiminimal only if t7 € Rs, and ord,(t*) < 1 for all p ¢ S. If S is sucha set 
of primes, if P € E(K), and if h,, is defined as in (14), then 


ord, (h,) = ordy(Dnp/Dp) 


for all p ¢ S and all n, by Proposition 4. This is, however, not the case for sequences 
(An)n>z and Dg p corresponding to points on twists of E, since E’ may have bad 
reduction at primes outside of S'. The following proposition allows us to restrict 
the amount by which the orders of these quantities vary at primes of bad reduction 
for E’. 


Proposition 5. Let E be as above, and let S be a finite set of primes containing all 
divisors of AB(4A? + 27B?) and all ramified primes, such that Rs is a principal 
ideal domain. Let E' be a K-quasiminimal twist of E, and let P € E'(K) be a 
point of infinite order. If P has bad reduction at p ¢ S, then 


nl i 
oyna 4 2, ue tn 
“= + ordy(h2) + ordy(n) if 2 | n. 
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Proof. Lett € K be as above, so that 2 € Rs 
E’: y? =x94 Ax 4+ 0°. 
The isomorphism ¢ : E — E’ is given by 
wlx.y) = (xe? yr). 
As E' has bad reduction at p ¢ S, and as Disc(E’) = t!* Disc(E), we have 
ord,(t?) = 1. Note that P can have singular reduction at p only if ord, (xp) > 1. If 
ord,(xp) > 1, then we have ord, (x} +1*Axp) > 3, and so 
2ordy(yp) = ord, (yp) = ord, (xp +t*Axp +1°B) = ord, (t°B) =3 

(recalling that ord,(B) = 0 for p ¢ S). This is impossible, as ord,(yp) € Z, and 
hence we must have ordy (xp) = 1. 

Let L = K(r), let p = q? in the ring of integers of L, and let O = @ '(P) € 
E(L). While Q is not K-rational, it should be noted that xg = xp/t* € K. As 


ord, (xp) = 1, we have ord,(xg) = 0, and in particular q + Dg. From this, and the 
fact that F has good reduction at q, we see from Proposition 4 that 


ord, (Dino) = ord, (Wen (Q)) 


for all n (where We, is the nth division polynomial for £). In particular, 


ord,(D20) = ordg(vo) 
= ordg(yp) — 3 ord,(t). 


We have seen that 
ord,(yp) = 2ordy(yp) = ord, (xp + c*Axp + 1°B) > 3, 


and so (as ord, (yp) is even) 
ordg(D29) > 0. 


Because p ¢ S, we must have ey = 1, and thus e, = 2 (in the extension L/Q). It 
follows from Proposition 1(b) that 


if2+n 
d n = ord Dn = 
ordy(Wen(Q)) = ordg(Dng) ord,(Do9) + ordg(n) if 2 | n. 


An examination of the division polynomials shows that 


Wer n(P) = 0" We n(Q), 
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and so (recalling that ord,(Dp) = 0), 


ordg (An) = ordg(Wern(P)) 
= (n? = 1) ord, (tT) + ord, (Wen(Q)). 


For n odd, the proposition follows immediately, since ordg(t) = 1, and p = q’. For 
n even, note that 


ord,(D29) = ordg(yp) — 3 ord,(t) = ordg (hz) — 3, 


and so 
ordg(hn) = (n? — 1) + ordg(h2) — 3. 


Again, we are done as p = q’. 


Theorem 8. Let E be as above, and fix an integer k > 3. Then there are only 
finitely many pairs (P, E’) such that E’ is a K-quasiminimal twist of E, P € E'(K) 
is a point of infinite order, andk € Z(De’p). 


Proof. Let S be a set of primes of R which contains all prime divisors of 
kAB(4A? + 27B?), all ramified primes, and such that Rs is a PID. 
By Mobius inversion we write the division polynomials of FE as 


Wen = I] Ven 


d\n 


for some functions yw; ,, € K(£), and the same for £’. Note that for n > 3, Vern € 
K(x). For functions f(x) € K(x), we will write 


f(x,y) = y®@P) f(x/y). 


Now suppose that P € E’'(K) is as in the statement of the theorem, so that 
Dxp has no primitive divisors. As Rg is a principal ideal domain, we will select 
s,t € Rg with (t) = Dp ands = xp?*. Our first observation, from the definition 
of the division polynomials, is that 


Wen (s, -) = Wen (s, Po) 


for all n (recall that tT € Rs is square-free in Rs). Again, to simplify notation, we 
will set hy = We’ ,(s,t?). - 

Let p ¢ S bea prime of R, and consider ordy (WF , (8, t?t7)). There are several 
cases. 


Case 1. P has good reduction at p and ry = 1: In this case, by Proposition 4, we 
see that 
ordy(Dnp/Dp) = ord, (hn) 
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for all n. Note that rp = 1 implies p | f, and hence p + s. It is an easy exercise [27, 
p. 105] to show that 


2 
ie aX) = m?x" | 4 lower order terms 


for each m. Recalling that S contains all divisors of k, we have ord, (d 25d°—1) =0 
for any d | k, and so ord, (hz) = 0 for any d | k. It follows at once that 


ordy (Wf (s,t?)) = 0. 


Case 2. P has good reduction at p andr, > 1: In this case ordy(Dp) = 0, and so 
Proposition 4 gives us 
ordy (Dip) = ord, (hn) 


for all n. Furthermore, p is not a primitive divisor of Dyp, and so either p + Dxp, or 
else rp is a proper divisor of k. In the former case it is clear that ord, (Winn (s, t?)) 
= 0, and so we will suppose that we are in the latter case. We have 


ordy(D;,mp) = ordy(D,, p) + ord, (m) 


for all m, and so if jz is the Mobius function, then 


ord, (WF, ; (Ss, r\jS 2 (da) ordy (hx ) 
dk 


dH A) ordp(D p) 


d|k 


> 1 (d) ordy(D« p), 


d\(k/rp) 


II 


as ordy(Dyp) = Oif rp + m. Writing k = k’ry, we obtain 


ord, (We, ; (s,t7)) = > (da) ordy(D iyi’ ,) 
d|k’ e 


= > yu (d) (ord, (D,,, p) + ord, (k'/d)) 
d|k! 
— 0, 


as ordy(k) = 0, and as Dan ju (d) = 0 for any m > 2. 


Case 3. P has bad reduction at p: In this case, Proposition 5 ensures that 
ordy(t) = 0, ordy(s) = 1 = ordy(t). Also, we have 
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nl 7 

f2 
ordy (An) = re : tn 
“= + ordp(i2) + ordy(n) if 2|n. 


If k is odd, we have 
ord. (er(6,0)) = > 1 (d) ord, (hs ) 
d|k 


= > w(d) deg(Wer c/a) = deg(We, ,)- 


d|k 


If k is even, we have (recall that ord, (k) = 0) 


ord, (Wir n(8.0°)) = Y) Hd) ordp (hs) 


d|k 


3 
= Dowd) deg(verxja)+ DY wd) (ord, (a . >) 
d|k d|k 
k/d even 


= deg(Wz ,)- 


The second term in the penultimate line vanishes as )> d\(k/2) u(d) = 0. 
We now have a value for ord, (Wi, , (5, t7)) in each case. To summarize, if we 
choose a generator (g) = sRs + t7?Rs, we have 


ord, (Wy, (s,0°)) = deg(W#, ,) ordp(g) 


for each p ¢ S. In particular, 


7 —de a ree 
VEn(S/g.007/g) = g SVE HE, (3,07) (15) 
is an S-unit. But s/g and t?t”/g are S-integers, and 


Ween (X) = Wen (X, 1) 


has at least three distinct roots forn > 3. Thus (15) defines a Thue—Mahler equation, 
which has only finitely many solutions. Each solutions traces back to a unique pair 
(E’, +P), proving the result. 


We now turn our attention to the claim that, for a sufficiently large prime p, 
max Z(Dz.p) may be bounded for points P € pE’(K) as E’ varies through a 
family of quadratic twists. Translating the problem back to F, this will require us to 
obtain a lower bound on sizes of the ‘denominators’ Dg of points Q € E(K) such 
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that [K(Q) : K] < 2. As mentioned above, Siegel’s theorem allows us to conclude 
that 


(1—«)h(Q) < log Nx(g)/aDo + O(), 


1 
[k(Q):Q 
but the implied constant depends on the particular field K(Q). This is insufficient 
for our needs. It turns out, though, that if p is large enough, we can obtain more 
uniform estimate for points Q = pQ’ with Q' € E(K) such that xg € K. 


Proposition 6. Let E be as above, fix a rational prime p = 3, and let 6 > 0. Let 
QO € E(K) be such that xg € K. Then 


: 5)\ » 
jet) ipa) < 
P 


where the implied constant depends only on E, K, 4, and p. 


1 


Proof. We begin by noting that there is a map f : P! > P! of degree p? such that 
xpo = (xo) 


for all O € E(K). Explicitly, in terms of the division polynomials, we may write 


_ Wnt1Wn-1 
9 


n 


f=x 


This is, a priori, an element of K(£), but is easily shown to lie in K(x). 
Writing ||x||, = [a | Xv for all v € Mx, and letting S denote the set of 
archimedean places of K, we have 


1 1 1 
Kyo 2 Nx(o)/eP po = iK:0) » 5 max {0, log ||xpa llv} 
1 1 
= xtra) KO] > max {0, log ||xpa |v}. 


ves 


We have (by the basic properties of heights; see Theorems 5.6 and 9.3(e) of [27]) 


h(xpo) = h(f(xo)) = p?h(xo) = 2p?h(Q) + O(1), 


where the implied constant depends on EF and p. 

We will now apply a version of Roth’s Theorem to the poles of f(x). Specifically, 
by Theorem D.9.3 of [17] applied to ; € K(x), we find that there is a constant 
c > Osuch that 


a min 0, log 


veS 


FG >= —[K : QH#S(2 + €)h(x) — loge 


y 
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for all x € K. In particular, we have 


—1 


2K : Q] > max {0, log | f(xo)|lvt = #82 + HA(Q) — Cc’, 


ves 


for some constant c’ depending only on E, p, and K. Combining these estimates, 
and noting that #S < [K : Q], we see that 


1 1 
xh(xpo) = KO] S| max {0, log |lxpally} 


ves 


> (pP?-[K:Q2+6)h(Q)-c", 


for some c”. This proves the proposition, as h(p Q)= p?h(Q). 


Ultimately, under the assumption that the nth term of Dz’ p has no primitive 
divisor, we wish to apply Proposition 2 to derive an upper bound on D,,p. The 
proposition provides no such bound, though, if m = sy for some exceptional 
prime p. In the proof of Theorem 4 we overcame this by employing the observation 
that the number of exceptional primes is bounded in terms of [K : Q]. The next 
proposition shows that, once attention is restricted to a family of quadratic twists, 
max{Sp : p is exceptional} may be similarly constrained. 


Proposition 7. Let E' be an elliptic curve over K, let P € E'(K), and let p be an 
exceptional prime for Dg p. Then Sy < M, for some quantity M depending only 
on K and j(E"). 


Proof. Let p be an exceptional prime for Dg p, and let p be the characteristic of 
R/p. Let 
E;(K) = {Q € E'(K) : —ordy(xg) > 2i}, 


as in Proposition 1, and note that sy is simply the order of P in E’(K)/E},(K), for 
N the least integer greater than ey /(p — 1). Note, as per [27, Ch. IV] and the proof 
of Proposition 1, that E{(Ky) is isomorphic to a formal group, and that if z is the 
coordinate corresponding to Q € E}(K), then ord,(z) = ordy(Dg). Furthermore, 
the multiplication-by-p map in the formal group is given by a power series of the 
form 

[plz = pz + O(2’). 


Now suppose that Q € E}(K) is not trivial modulo £4, (K), ie., that ord)(Dg) < 
€p/(p — 1). Then 


ordy(Dpo) = ord, ([p]z) = ordy(z) + ordy(p + O(z)) = 2 ord, (z), 


since 


e 
ordy(z) < oi < ey = ordy(p). 
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By induction, the order of Q in E|(K)/Ej(K) is at most p”, where n is the least 
integer greater than log, N. This bounds the order of any element of EF} (K)/E(K) 
in terms of éy and p. 

It now suffices to bound ry, which is the order of P in E’(K)/E{(K). If E’ has 
good reduction at p, then 


E'(K)/E\(K) & E(R/p), 


where E is the reduction of E’ modulo p. Thus, by Hasse’s theorem the order of P in 
E'(K)/E}(K) is at most (/#(R/p)+ 1). If E’ has bad reduction at p, let E}(K) © 
E’(K) be the subgroup of points with nonsingular reduction modulo p. Hasse’s 
theorem again bounds the size of Ej (K)/E}(K), while the size of E’(K)/E((K) 
is at most max{4, — ord, (j(E’))}, by a theorem of Kodaira and Néron [27, Theorem 
6.1]. 

Thus, for each p, we have sy < M, for some quantity M, depending only on p 
and j(E’) (in fact, we have not yet needed the fact that p is exceptional). As noted in 
Remark 3, p can be exceptional only if p < [K : Q]+ 1. Considering the maximum 
of the M, over the finitely many primes p with p < [K : Q] + 1, we have our 
bound M. 


Theorem 9. Let E be a K-quasiminimal elliptic curve, let € > 0, and let p = 
VI[K : Q\(4 + €) be a rational prime. Then for each K-quasiminimal twist E' of E 
and each nontorsion point P € pE'(K), max Z(Dg’.p) < C for some constant C 
which depends on E, K, and p, but not on E' or P. 


Proof. Suppose that D,,p has no primitive divisor. By Proposition 7 there is a 
quantity M, depending only on K and j(£’) = j(E), such that sy < M for all 
exceptional primes p. As our goal is to provide a bound on m that depends only on 
E and K, we will assume that m > M. By Proposition 2, then we see that 


log |Nx/gDmp| < > log Nx/oDxep + d log(2d) + d log(m), 


k|m with 
m/k prime 


where d = [K : Q]. Note that 


d 
sz logNx/oDkp < am a her) 
k|m with k|m with 
m/k prime m/k prime 
< So (dhkP) + O(A(E)) 
k|m with 
m/k prime 


< dh(mP))~ = + O(log(m)h(E’)). 


q|m 
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As before, we have an isomorphism E — E' by (x, y) © (xt?, yt?) for some 
t € K with t? € K. Taking d to be fixed, and noting that h(E’) < 1+ h(t), we 
have (if m € Z(Degp)), 


log|INxjoDmp| < do(m)h(m P) + O(log(m)(1 + h(x))), 


where p(m) = Yoain 4 
On the other hand, we have Xmg = Xmp/ tT”, and so 


1 
SAN. SL log Nx ) oDin = log Nx oDmp a dh(t). 
[K(Q): K] * M"enne 
Finally, if P € pE(K), we may write P = pP’, for some P’ € E(K), and set 
QO’ = ¢ '(P’). It follows from Proposition 6 (applied with Q = Q’ and 6 = €/2) 
that 


d(2+€/2) 
(1- P 


) afc) _ (1 = ao 


) aivmg) 


1 
= a oe toe N Dmg + O(1) 
[K(Q) : K] K(Q)/Q’mQ 
log Nx/joDmp + O(A(t)) 


< dp(m)h(m? P) + O(log(m)h(z)). (16) 


IA 


By Theorem 6, there is a constant 5 > 0 depending only on j(£) and K such 
that h(P) > 5(1 + A(t)) (on the assumption that P is not a point of finite order). 
Dividing both sides of (16) by m*h(P) then yields 


d(2+ €/2) 


<a) (17) 


< pim) +0 (M8 


where the implied constant depends only on F, K, and p. But 


132 2’ 


1 1 I a | 
2 
p(m) < a q Sg a Oe eG 
q prime n=17 
and so (17) bounds m, since our condition on p ensures that 


d(2+<¢/2) 1 
po 2 


This proves the result. 
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Finally, we should note that the restriction in this section to curves in short 
Weierstrass form is not, strictly speaking, necessary. Any elliptic curve over K 
may be written in short Weierstrass form with a change of variables. If Dz,p is 
a given elliptic divisibility sequence, we may choose this transformation such that 
the values of D,p are unchanged at primes not dividing 6. By enlarging the set S 
in the statement of Theorem 8, then we can treat all K-quasiminimal models of 
elliptic curves in a given family of quadratic twists (as long as the finiteness in the 
conclusion of the theorem is now interpreted as finiteness up to this sort of change of 
variables). Similarly, Theorem 9 may be made independent of finitely many primes 
(for example, those above 2 and 3) by increasing the set S appearing in the proof 
of Proposition 6 to include these primes. In this case, however, we must require that 
p > J#S(4+ €) (where S' contains at least all infinite primes). 


1.5 Speculative results over Q 


As we have seen in Theorem 7, there is a uniform bound on the size of the 
set Z(Dze,p) if one is prepared to accept the conjecture of Szpiro. It seems, not 
surprisingly, rather more difficult to establish a uniform bound on the largest element 
in the set Z(Dg,p). However, if one restricts attention to certain families of elliptic 
curves, then a bound may be obtained under similar assumptions. For simplicity, we 
work over Q. 


Conjecture 4. (Hall [15]) For every « > O there exists a constant C. such that 
whenever x, y, and M + 0 are integers satisfying 


y=axtM, 
then there is a bound of the form |x| < C.M?T*. 


Theorem 10. Suppose Hall’s Conjecture 4 holds. Then there is a finite set E such 
that if M is a sixth-power free integer, E : y> = x* + M, and P € E(Q) is 
nontorsion, then (M, P) € € or 


Z(Dze,p) © {1, 2,3, 4}. 


Proof. In light of Theorem 3 of [18] (the analogue of Theorem 8 in this paper), it 
suffices to produce a uniform bound on max Z(Dz_p) for all E of the above form. 
Suppose Q € E(Q) is a point of infinite order, and let 


Ag Bo 
Q= (42. 32 
Q Q 
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as usual. We have BS = Ao + MD¢,, and thus, for a fixed € > 0, the conjecture of 


Hall tells us that 
|Ag| < C.(MD9Q)"**. 
It follows that 
hy(Q) < 6(2 + €)log|Do| + O(log |M)), 


where the implied constant depends on €. So we have, for any Q € E(Q) of infinite 
order, 


lo 
log|Do| = 3070) 2) + O(log|M)). (18) 


Now let P € E(Q). We have by Propositions 2 and 3 that n € Z(Dz,p) only if 
log |Dne| < p(nyn?h(P) + O(og(n) log |M|). (19) 
with explicit constants. Consider (18) with Q = nP. Noting that 
log|M| < O(h(P)), 


we have 


1 2 2 
———n’ < Ol , 
ao <Plnyn® + Ollogin)) 
If we take € < 1, we see that 1 is bounded by some WN as long as p(n) < 0.1, say. 
The latter condition is ensured if (n, 6) = 1. Appealing to Theorem 6, our bound is 


max{N, max(Z(D_g,p) N 2Z), max(Z(Dzg,.p) N 3Z)}. 


Remark 5. Note that, much as in Theorem 7, we only really need the weaker 
assumption that Hall’s Conjecture holds for sufficiently large ¢. If (18) holds for 
any value of ¢€, then (19) yields an upper bound on the values n € Z(Dze,p) such 
that p(n) < as for example. But it is easy to check that if p(”) > oe! then 
n has a prime divisor p < 6(2 + €). Computing a bound as in Theorem 6 for each 
such p, we obtain a uniform bound on max Z(D zp) for j(E) = 0. 


Extending this idea, one can prove a general result if one accepts two stronger 
conjectures, both due to Lang. The first was already stated over number fields 
(Conjecture 2), but we restate it here over Q for the convenience of the reader, while 
the other is a genarlization of Conjecture 4. 


Conjecture 5. (Lang [19]) There is an absolute constant C > 0 such that for every 
minimal F'/Q and every nontorsion P € E(Q), 


h(P) > Ch(E). 
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Conjecture 6. (Hall—Lang [20]) There exist absolute constants C; and C) such that 
if x, y, A, B are integers with 4A? + 27B? # 0 and 


y2 =x + Ax+B, 


then |x| < C; max{| A], |B]}@. 


x 


Theorem 11. Suppose that Conjectures 5 and 6 hold. Then there exist absolute 
constants M, and M) such that for all E/Q and P € E(Q), ifn € Z(D_g.p) then 
eithern < M, or there is a prime p < M) such that p|n. 


The proof of this theorem is nearly identical to the proof of Theorem 10. 


Remark 6. If one restricts attention to a given family of quadratic twists, then 
Conjecture 5 is known to hold. Thus, if one assumes that Conjecture 6 holds, at least 
for the given family of twists, one may apply Theorems 8 and 6 to obtain a statement 
analogous to Theorem 10. That is, one deduces that except for a finite number of 
exceptions, elliptic divisibility sequences Dzp arising from this family of twists 
satisfy Z(Dz,.p) C {1,2}. Note also that, under the assumption of Conjecture 6 
for all elliptic curves, a uniform version of Theorem 6 would provide a uniform 
bound on Z(Dz¢,p) for curves E/Q. Such a uniform statement, however, requires a 
refinement of Roth’s theorem that is far beyond current Diophantine analysis. 
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The heat kernel, theta inversion 
and zetas on [\G/K 


Jay Jorgenson and Serge Lang 


Abstract Direct and precise connections between zeta functions with functional 
equations and theta functions with inversion formulas can be made using various 
integral transforms, namely Laplace, Gauss, and Mellin transforms as well as their 
inversions. In this article, we will describe how one can initiate the process of 
constructing geometrically defined zeta functions by beginning inversion formulas 
which come from heat kernels. We state conjectured spectral expansions for the 
heat kernel, based on the so-called heat Eisenstein series defined in [JoL 04]. We 
speculate further, in vague terms, the goal of constructing a type of ladder of zeta 
functions and describe similar features from elsewhere in mathematics. 


Key words Zeta function ¢ heat kernel * spectral expansion 
Mathematics Subject Classification (2010): 35K08, 11M36, 11F72 


The survey article “Riemann’s zeta function and beyond” [GeM 03] and the book 
Introduction to the Langlands Program [BeG 04] describe one direction for the 
theory of automorphic forms. Its successes and ramifications for three decades 
have deservedly attracted much attention. We would like to describe here a way of 
looking at some aspects of analysis on certain spaces of type [\\G/ K (which will be 
defined below) pointing to a different direction with different emphasis. We do not 
question that both directions interact with each other, but perhaps most effectively 
after this second direction achieves certain successes of its own. However, this other 
direction that we are presenting requires a reworking of the foundations of the sub- 
ject, and explaining from scratch how it evolves inductively from the simplest case 
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where K» is the unitary group SU(2). Actually, even this case is already one level 
above the most classical case of 277 Z\R, i.e., the case of the circle with which we 
start our exposition. 

The heat kernel is present as the ubiquitous and fundamental object [JoL 01b]. 
As will be seen, one may summarize its manifestation in this survey by saying that 
the ¢ in the heat kernel K, is the same f¢ as in the Poisson theta inversion formula 


t'/76,(1/t) = 0(t). 


We shall describe a way to zeta objects through theta objects, starting with the 
most classical case on the circle as above. The procedure takes five steps: 


¢ Start with the heat kernel on G/K. 

e Periodize with respect to T’. 

¢ Determine the eigenfunction expansion on [\G/K. 
¢ Regularize the expansion and integrate over ['\\G. 

e Apply the Gauss transform. 


This yields what amounts to the logarithmic derivative, with fudge terms. It may 
be viewed as preparing the ground for seeing how the functions so obtained fit into 
ladders (cf. Section 6), whenever one has a sequence of suitable embeddings 


a Gi/ Kn = Gn4i/Kn+1 Sees 


The most important case for us will be with G, = SLy,, or certain subgroups 
isomorphic to products of certain SL», (m < 1) called parabolics. We shall proceed 
stepwise. We concentrate on what seems a special case with the groups SL,; 
however, we will describe a general theory whereby SL,,(C) becomes a controlling 
group for semisimple Lie groups; see Section 6. Other classical groups also play 
such a controlling role, such as SO(p, g) and Sp(7). 
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1. The circle and Riemann’s functional equation! 


The Riemann zeta function €g(s) can be defined in two ways: 


f(s) = p> n * or via the Euler product [a = ey: 


'The present article was completed by Jorgenson and Lang during the summer of 2005 shortly 
before Lang passed away on September 12, 2005. As such, this article is the last mathematics 
paper written by Lang. At the time the article was written, it was the intention to describe the 
future direction that Jorgenson and Lang envisioned for their research. 
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The first way reflects analytical properties, and the second exhibits a number- 
theoretic connection with the primes. The analytic connection, which will become 
relevant in Section 2, is that n can be viewed as the square root of an eigenvalue 
n? for the positive Laplacian on the functions e!”*. We return to this later. Here, we 
want to indicate the way to basic analytic properties of the zeta function. 

In general, we define a theta series to be a series of the form 


O(t) = > ane", 


where {a,} is a sequence of complex numbers, and {A,,} is a sequence of real 
numbers tending to infinity. In practice, these 1,, are > 0. They increase sufficiently 
fast so that the series converges absolutely. The notation suggests that often the 1, 
will be related to eigenvalues. We consider the universal covering 


R= R/2nZ. 


Here [ = 27Z is the discrete group of translations by integral multiples of 27. 
Given a function f on R, its I'-periodization is the function defined by 


f(x) = YF fe + 2a), 


neZ 


What do we do with a periodic function? Advanced calculus says we expand it in a 
Fourier series. Are there best possible functions to do this with? The answer depends 
on what we want to do with them. For some purposes, one wants to deal with non- 
smooth functions, e.g., the sawtooth function whose Fourier series is Dei /n. For 
our purposes, we want a priori better behavior than that. There is essentially a unique 
class of functions that work out most easily while having extensive applications, 
e.g., to zeta functions, namely the gaussians g(x) = e-* /¢, with some positive 
constant c, and linear combinations of these, or simply constant factors times these 
functions. The constant factor is normalized to get the probabilistic condition that 
the total integral is 1, i-e., for an arbitrary positive function f we let 


If) = | f@)dx, 
H 


so f/I(f) is probabilistic. This still leaves a choice for the positive constant c. As 
far as we are concerned, there is a best way to express this constant, namely c = 4f, 
in which case /(f) = 4m, and thus we let 


1 _» 
2G) = weevil es (1) 


In any case, whether we put c = 4f¢ or not, let We = @/1(g-). Then {We} is a 
Weierstrass—Dirac family, or {g,} is a Weierstrass—Dirac family, which we define 
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as follows. Let {g;};+0 be a family of continuous functions on R. We call {g;} a 
Weierstrass—Dirac family if it satisfies the following conditions: 


WD1. ¢, 20. 
WD 2. f g:(x)dx = 1. 
R 


WD 3. Givend>0,wehavelim f g,(x)dx = 0. 


10 a(%.0)25 


Here, d(x, 0) is the distance between x and 0. But with this notation, generalizations 
can be immediately formulated, e.g., to euclidean space and onward. 

A Dirac family is also called an approximation of the identity because of the 
following approximation theorem, whose first manifestation is in [Wei 1885]. 


Let f be a measurable function on R. Let {g;} be a Weierstrass—Dirac family. 
Define the convolution 


eas / ely) fe — yay. 


R 
Then g; * f — f uniformly on every set where f is uniformly continuous. 


In particular, we get pointwise convergence where f is continuous. 

Now a word about using the parameter ¢ instead of c. The floating constant 
is uniquely determined by the Dirac family condition and a differential equation, 
called the heat equation, namely if @, = (0/dx)*, then 


(—@, + 0;)g+(x) = 0. 


This is the structural explanation for formula (1) defining the heat gaussian. We then 
take the following steps: 


e Start with the heat gaussian g,. 
* Periodize it, to get g’. 
e Expand the periodization in a Fourier series. 


A calculus computation shows that the Fourier series is what occurs on the right of 
the following relation 


1 és 2 1 als. Ga ke 
> e (x+27n)*/4t se en t pinx 


. Ant ne€Z 21 ne€Z 


Observe that the left side is an inverted theta series times the power t~2. The right 
side is a theta series. Both have variable coefficients depending on x. We then 
perform a fourth step: 


e Put x = 0, yielding a theta inversion relation 


1 2 1 2 
—(2mn)*/4t _ —n*t 
) == 6 = ) e€ F 
V/4rt 20 


n€Z ne€Z 
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also called the Poisson inversion formula. (Note: Because of the special abelian 
nature of R, one gets away with a misleading technical trick to eliminate the variable 
x by setting x = 0.] 

Having such an inversion formula, what does one do with it? Riemann applied 
the Mellin transform, defined by 


may(s) = [ a(n 
0 


Then easy calculus shows that one gets the zeta function with some fudge factors. 
The inversion relation (with ¢,1/t on the two sides of the relation) becomes the 
functional equation interchanging s and | — s: 


Af(s) = AC(1 —s) defining AC(s) = 2~*/7T (s/2)LQ(s), 


with fudge factors 2~*/?, T'(s/2) (I boldfaced to distinguish from a discrete 
subgroup). 
We work with another transform, which we call the Gauss transform, defined by 


Gauss(8)(s) = 2s f eo O(t)dt. 


0 


Never mind the fact that the integrals don’t converge for the moment. In the 
monograph [JoL 94] (reproduced in [LanJo 01] Vol. V) we explain how to regularize 
this integral. Formally the integral is easily evaluated on functions of the form 
e~”*. The Gauss transform of this term for n # 0 is 1/(s* +n”), and Gauss(@) 
is essentially the logarithmic derivative of sine. Thus the Gauss transform has an 
additive structure (Mittag-Leffler type) in contrast with the multiplicative structure 
for the ordinary zeta function. The functional equation of Gauss(@)(s) is given by 
the obvious symmetry coming from the invariance of the integral under s +> —s. 


2 The next case: SL2(C), eigenfunction expansion 


We have a choice: SL2(R) or SL2(C). From the point of view of freshman calculus, 
SL,(R) is “easier”. From any more ambitious point of view, SL2(C) is easier, for 
the following main reason. We want gaussians analogous to those on R. For SL2(C), 
the gaussians are “split” in a way that they look like those on R. For SL2(R), one 
needs an extra integral which at first looks disagreeable, but which can be explained 
structurally. We now go into such structures. 

We start with the basic question: what is the analogue of Fourier expansion? 
Here is where the eigenfunction property of e'”* (or e” for the theory of Fourier 
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transform on R itself) becomes central. One does not know in general how to 
write down a priori elementary functions by a formula that will play the role of 
these exponentials. But thanks to insights of Bargman, Harish-Chandra, Selberg, 
Maass, Roelcke and Langlands, one does know that such functions exist as eigen- 
functions of a laplacian. We shall now describe how one gets to the eigenfunction 
expansion. 

From complex analysis, everybody knows that the group G = SL2(R) acts by 
fractional linear transformations on the upper half-plane hy. The subgroup leaving 
i fixed is the unitary subgroup K (which is maximal compact). Thus we have an 
isomorphism of homogeneous spaces 


G/K = hy sending z-> g(z) = (az+ b)(cz +d) 


ifg = (: ) € SL2(R). The action on the coset space G/K is just translation. 
c 


The non-commutativity is analyzed via a product decomposition of G. Let 


U = subgroup of upper triangular matrices u(x) = (( with x € R. 


A = subgroup of diagonal matrices a = diag(a1, a2) with diagonal elements 
a,,a2 > 0. 
K =real unitary subgroup K(R). 


Then the product map 
UxAxK— UAK=G 


is a bijection (real analytic isomorphism). Each element g € G has a unique 
decomposition g = uak, called the Iwasawa decomposition, with u € U,a € 
A,k eK. 

The group A is isomorphic to the positive reals R*. For ulterior purposes, we 
write the isomorphism with notation fitting the additive version on the R-space of 
real diagonal matrices H = diag(h,, h2) with h;, hy € R. We let a be the character 
defined by w(H) = h,—hp2. Thena is an isomorphism of the diagonal space with R. 
Its exponential is the isomorphism we want of A with R*. The map x. : A —> Rt 
sending 

dtey == er(losa) — a,/az = a; 


is a group isomorphism of A with Rt. We call y the Iwasawa coordinate on A. 

The two subgroups U and A are used to provide eigenfunction expansions on 
G/K, thus reducing such expansions to abelian groups. 

We shall state the eigenfunction expansion formula, but first we comment on 
gaussians, for which we want to carry out the explicit steps of Section 1. Being on 
R causes all the possible formulas to be twisted in a somewhat disagreeable manner. 
Going to SL2(C) makes life much easier, and we shall explain below the method 
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for recovering SL2(R) afterwards. Thus instead of the upper half-plane, we use the 
upper half 3-space hs, defined as the set of quaternions 


Z=xtjy=x+int+jy 


with x € C, y > 0 and k-coordinate equal to 0. We let G = SL2(C). Then G acts 
on hz by 
ze> (azt+bj(ez+d)". 


Brute force shows that the image of the fractional transformation does indeed lie 
in hg (1.e., has k-coordinate 0). The isotropy group of j is K(C), the usual unitary 
group over C. When dealing both with R and C simultaneously, we have to keep 
them in the notation, but working with one case for a continued period, we omit this 
extra notation. We now suppose that we are over C, unless otherwise specified. 

We have the three groups: 


U = U(C) consisting of elements u(x) with x complex, so U = C. 
A = same group as for SL2(R)! 
K = unitary group. 


The product map U x A x K —> UAKis a bijection with G = SL,(C), again 
called the Iwasawa decomposition. We have the same character a as before on A, 
with coordinate y. We call (x, y) the Iwasawa coordinates. 

We can now define gaussians. The group G also admits a polar decomposition, 
G = KAK,g = kibkz with kj,k. € K and b € A, uniquely determined up to a 
permutation of the diagonal elements. We define the polar height o by 


a(g) = (log ys)’. 


Then a gaussian on G/K is a positive multiple of a function having the form 


- —o7(g)/2c log y = —y*/2¢ v 
Gc(g) =e (y — y—!)/2 sinh v 
— e707 /2¢ o ; 
sinh o 


with c > O with additive variable v = log y, and y = yz is the polar y. The factor 
on the right is a Jacobian factor but we do not need to go into this. 

We use the Iwasawa Haar measure given by d(z) = dx;dx2dy/y>. Then we 
can define the total integral /(f) for a function f on G/K = hz, and also the notion 
of Dirac family, using the distance function on G/K, 


d(z,w) = o(z 'w) 
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which is G-invariant. We can then define Ww = @/I(g-) as we did on R. Then 
{w-} is a Dirac family, using the same definition as on R, except that for the third 
condition WD 3, one takes the distance d(x, e) between x and the unit element of 
G, or the unit coset eK, because the distance is K-bi-invariant. The approximation 
theorem is valid. Furthermore, the vector space generated by gaussians is dense 
in anything one wants, see [JoL 03b]. This space can be used to develop the 
general spherical inversion theory by explicit formulas [JoL 03a]. Most importantly, 
it contains the heat kernel discussed below. 

As to the laplacian, it is easier here to use the group theory to define a Casimir 
operator (which would turn out to be a scaling of the laplacian if we used the 
language and context of Riemannian metrics). Casimir is defined entirely in terms 
of a G-conjugation invariant scalar product on the Lie algebra g of G as follows. 
The Lie algebra is just the vector space of matrices having trace 0, with the bracket 
product [X, Y] = XY — YX. We use the real trace form for the scalar product of 
matrices Z;, Z2 € g, that is, we put 


B(Z}, Zz) = Re tr(Z,Z). 


Let Z € g. One can define a left-invariant differential operator denoted Z or D(Z) 
(the directional derivative in the direction of Z) by the formula 


~ d 
(Z/)(8) = f(g exp(tZ)). 
t=0 


If {Z;} is a basis for g over R, and {Z;} the dual basis, then Casimir is defined by 


soy Z7, 


which is independent of the choice of basis. Given an invariant Riemannian metric 
on G/ K(= hs), Casimir is equal to a constant times the laplacian with respect to 
this metric, but one can also carry out everything in terms of the group structure as 
we have done above. 

The heat equation for Casimir is then —wh, + 0;h; = 0. There is a unique heat 
gaussian (gaussian satisfying the heat equation and the Dirac properties) given by 


g,(g) = (8xt)3/e%e-7(@/" SF _ 
sinh o 


(2) 


which is in line with the euclidean heat gaussian. The general analysis of which this 
is a special case was done by Gangolli [Gan 68]. 

What we have defined above suffices to formulate the first two steps, the 
periodization being with respect to the group SL2(Z[i]) in the complex case, and 
the group SL2(Z) in the real case. For the remainder of this section, we set 
T = SL,(Z[i)) 
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There remains to explain what the eigenfunction expansion is analogous to the 
Fourier expansion on R (or euclidean space). This comes originally from Roelcke 
[Roe 66] in the context of functional analysis on SL2(R). Because of the more 
complicated structure of the group, the expansion has a series like a Fourier series, 
but it also has a continuous version analogous to the Fourier integral. We give the 
definitions needed to state the expansion. 

A function on [\G/K is T'y-periodic (Ty = [NU * Zf[i]). Hence it has an 
ordinary Fourier expansion on C/Z[i]. We say that a function f is cuspidal if the 
constant term of this Fourier series is 0, that is, 


; flug)du(g) = 0 for all g € G. 
Ty\U 


The cuspidal subspace L2,,.(['\G/K) then has an orthonormal basis {y; } consisting 
of eigenfunctions of Casimir, with negative eigenvalues —A ;. 
On the other hand, to get the continuous part, we use the Eisenstein series, 


depending on a complex parameter s, and defined by 


E@=£6.0= > Gat = > yo", 


Tu\C Ty\C 


where (yz), is the projection on A from the Iwasawa decomposition, so we can 
apply the character 7, to this projection, writing this application exponentially. We 
can write each term as we did in terms of the Iwasawa y-coordinate. The Eisenstein 
series is thus the Cy \I'-periodization of the character y;.. The series is absolutely 
convergent for Re(s) > 2, and has a meromorphic continuation. The character 7 sq 
is an eigenfunction of Casimir with eigenvalue 25(s — 2), and so is the Eisenstein 
series. 
Convolution on G itself is defined by 


if s)@= | few acw)dutw). 
G 


Given a function of two variables F(z, w), the convolution is defined by 
(F * h)(z) = / F(z, w)h(w)du(w). 
G 


We superimpose I’-periodization on this. For a function f on G, we let 


f°) = >> fow). 


yer 
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Then we have the group Fubini theorem 


yer 


[ fordnon = ff o%dun6o = f YL fowdurct. 
G 


r\G r\G 


Lt F be G-invariant (meaning F'(gz, gw) = F(z, w) forall g,z,w € G). Define 
the periodization 


F'@w) => F(yz.w) =) FQ yw). 


yer yer 


If 4 is a function on T'\G, then 


i F(z, w)h(w)du(w) = / F'(z,w)h(w)dur\e(w), 


G r\G 


where jtp\G is the natural homogeneous space measure induced by dy on G, 
coming from the fact that G and \G are locally measure isomorphic. 

For convolution purposes, it is useful to deal with the integral kernel in two 
variables derived from a function in one variable via the group law. Thus we now 
define the heat kernel 

K,(z, w) = g,(z'w). 
The heat kernel is right K -invariant in each variable, so defined on G/ K x G/K. Its 
T’-periodization is given by 


Ki (z,w) = )> Ki yw) = Ki (yz) = > 8, @ yw). 


yer yer yer 


Eigenfunctions of Casimir are also eigenfunctions of convolution by the heat 
kernel. If a f = —Af, then 
K,* f=e"f 


In particular, @ ¥sq = 25(8 —2)Y¥sa, SO 
K+ Eo en "8, 


Convolution can of course be defined using any function of two variables, for 
instance E(s, z), taking the integral with respect to either variable. We have 


ens U Bine= | AWE dune), 


r\G 
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so it is the hermitian scalar product (Eisenstein coefficient up to a constant). Of 
course, all these formulas hold under assumptions of absolute convergence which 
have to be specified or proved. 


Theorem 2.1. Eigenfunction expansion. For f € C2°(P\G/K), one has the 
eigenfunction expansion 


Co 
f= iti taif+tee f Ex f\GEG.2dIm6) 
j=l Re(s)=1 
With the Iwasawa measure, the constants are Cco(f) = rtes.(E * f) and cr = 
1/162. (Notation: res means residue at 2.) 


We have written the convolution FE * f, viewing E as an integral kernel function, 
but of course it has here the interpretation as the scalar product as above. So the 
Eisenstein series forms a continuous family playing the role of the exponential e”” 
on the line. The integral is actually on the imaginary line s = 1+ ir,r € R. 
The eigenfunction expansion is originally due to Roelcke [Roe 66], in the context 
of functional analysis and spectral decomposition, so it is usually called spectral 
expansion. The factor 1/16 = 1/4? cancels a quadruplication due to the units of 
Z{i] in our definition of Eisenstein series. 

To avoid disagreeable convergence problems, the eigenfunction expansion was 
stated for C® functions with compact support, using the space of test functions 
which go as far as possible in eliminating convergence problems. However, in real 
life, interesting functions do not have compact support, for instance the gaussians, 
especially the heat gaussian. However, gaussians have quadratic exponential decay, 
but Eisenstein series have only linear exponential growth, so no convergence 
problem arises for their convolutions. 

We may now state the eigenfunction expansion for the heat kernel. 


Theorem 2.2. With the Iwasawa measure and cy = 1/vol(T\G), 


Ki.) = doe Wj @¥j() +c 


j=l 


[o.e) 
+ CE / e+) B+ ir, 2) E(1 —ir, w)dr. 
—oo 


Note that the exponent —2(r? + 1) is the eigenvalue 2s(s — 2) on the line 1 + ir 
(r € R), which is the imaginary line at the middle of the critical strip 0 < Re(s) < 2. 
Thus the formula is an expansion of the heat kernel in terms of its eigenfunction 
components, the first part being discrete, and the second part being continuously 
dependent on the parameter 7, in a manner similar to Fourier series and Fourier 
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integrals. Putting z = w gives the heat kernel on the diagonal. The periodization on 
the left is then 


K; (2,2) = > Ki(vz.2) = D8, v2). 


yer yer 


This periodization is therefore expressed in terms of conjugation, and we are led to 
investigate conjugacy classes. 

For any group G and g € G, we let e(g) be conjugation by g. Let T be a 
subgroup of G and I’ a subset of I’ which is e(T)-invariant. Then [’’ is the union of 
T’-conjugacy classes. We denote the set of such classes by 


CCr(T’), 
and its elements by c. Then I’ is the disjoint union of the classes 
c€ CCy(T’). 


We apply these general considerations to our concrete case of SL2(C) and the 
heat gaussian, together with the heat kernel. We decompose IT into two e(T)- 
invariant subsets. We define the standard cuspidal subgroup of I to be the subgroup 
of upper triangular matrices in I’, denoted by 49. We define the cuspidal and non- 
cuspidal subsets by 


CusT or Teuy = e(")loo andits complement NCI = T —Tqs. 


The trace K} can then be decomposed into two series, non-cuspidal and cuspidal 
respectively, 


KN @2= DY) Kilyzz) and KS = DY) K(yz,2), 
y€NCusl yeCus T 


so we can rewrite the eigenfunction expansion of K) in the form: 


Theorem 2.2’. 


KN (z, 2) + KO (z,2) 


co 


Cc 
=e Wi OP+ ater f EC + ir. oPar. 
—oo 


j=l 


This is the form of the eigenfunction expansion which we shall use in the next 
section. On the left, the two sums KNC and Ke are inverted theta series with 
variable coefficients. On the right, we have a theta series with variable coefficients, 
and a theta integral, also with the variable z as a parameter. The series converge 
absolutely. 
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We want to integrate the eigenfunction expansion term-by-term over ['\G, which 
has finite volume. Formally, this will give us a theta inversion identity. The main 
problem is that out of five terms, the integral of two of them is not convergent. 
Before explaining what to do with them, we first describe what the integration gives 
in the two cases for which the integral is convergent, namely the non-cuspidal series 
on the left and the theta series with variable coefficients on the right. 

Having picked {y;} to be an orthonormal basis of L2.,,([\G/K), using the fact 
that [\G/K has finite measure, there is no problem integrating the series, and we 
get a theta series with constant coefficients, 


(1) Gow () = Doe", 


j=l 


where we remind the reader that —A ; is an eigenvalue of Casimir on L2,,([\G/K). 


cus 


The integral of the non-cuspidal trace of the heat kernel is absolutely convergent, 
and can be computed explicitly, giving an inverted theta series: 


(2) / KN (z, dpe) =e (41) 2 ON (1/2) 
r\G 


where @™° is the non-cuspidal inverted theta series using [’’ = NCusI’: 


(2a) ON 1/1) = D> agents rel /4t, 


c€CCr(I’) 


with constants a,, | log b,|* that are determined explicitly. Cf. [JoL 03], [JoL 04]. 
Specifically, let y, be a representative of c, and I’, the centralizer of an element 
y € Tl. Then b, = A-polar component of y,, and 


i6e 2 (20)~3 
a= 02 |2e0l- 7-219 ? 
* ~ |Tor(c)| Ute = HEP 
where 
Tor(c)| = order of the torsion group of I,,., which is finite; 
g p Ve 
Ne = eigenvalue of elements in c; 


Nco = eigenvalue of absolute value >1 for a primitive element of I,, 
(which is infinite cyclic modulo the torsion group). 
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This explicit determination can actually take several forms. The form we use in 
the above reference is in terms of invariants associated with each conjugacy class 
in the group I’. There are different ways of expressing these completely in terms 
of number-theoretic invariants over the rational numbers or over Z[i]. These two 
alternatives are reminiscent of theories that establish a bijection between conjugacy 
classes in a Galois group and objects associated directly with the base (class field 
theory, covering space theory, etc). This direction deserves a separate treatment 
elsewhere. It connects with the work of Zagier [Zag 79], [Zag 82], Szmidt [Szm 83], 
[Szm 87], and Venkov [Ven 73]. 

This leaves the cuspidal trace and the Eisenstein term to be dealt with. Their 
integrals over [\G are divergent. However, both of these divergences are “the 
same” in the following precise sense. There is a natural fundamental domain F 
for [\G/K = I\hs. This domain has a tube structure in terms of the coordinates 
(x, y),x €C, y > 0; precisely, 


O<m< xitxgty? 21. 


1 
oy 
Let F(V) be the subset of F consisting of those points such that y S Y for Y large. 
Let 
oo 
Eis,(Y) = = 7 i e204 ECL + ir, z|/?drdu(2), 

F(Y) —9° 

and 
Cus ;(Y) = / K“ (z, z)du(z). 
F(Y) 

Theorem 3.1. Eis;(Y) and Cus ;(Y) have asymptotic expansions of the form 


c(t) log Y + c2(t) + o(1) for Y > oo, 


the factor c,(t) being the same for both functions, namely 


e2! 


c(t) = Qnty?" 


Therefore the term c;(t) log Y causing divergence cancels, and we can integrate 
the rest over all of F (representing [\G/K) letting Y — oo. What results are a 
theta integral and an inverted theta series respectively, 


Oeis(t) = lim [Bis,(¥) — ci(t) log ¥] 


es (1/t) = jim [Cus ,(Y) — c(t) log Y]. 
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These are called the regularized Eisenstein and cuspidal terms respectively. For the 
regularized Eisenstein term we get 


CO 


(3) Ogis(t) = —e77! + = / e204 HG! /ou)(1 +ir)dr, 


—oo 
where, putting F = Q(i), the function ¢); is defined by 


Age(2—s) 


gou(s) = ~ Aeg(s) 


and A€p(s) = 2° (s)or(s). 


The use of A is classical to denote the “completed” zeta function, with its fudge 
factors needed for the functional equation. Thus the Riemann—Dedekind zeta 
function appears! 

Finally, the regularized cuspidal inverted theta series is computed. For this, we 
let 


wiry = (G1) and fr) = Blur 


With this, we compute explicit constants a1, a2, a3 and a, such that 


(4) es (1/t) =a / Ki()rdr + a | Si(r) log(r)rdr 
0 0 


+a3 / Si (r) log (=) rdr + aq. 
0 


The constants a), a2, a3, and a, are explicitly computed using classical known 
numbers, meaning rational numbers, z, log2, logz, and y which is the Euler 
constant, as well as an Euler constant yqii) associated to the number field Q(i) (see 
also [Szm 83] and [Ven 73]). As we show, the appearance of the Euler constants 
comes from the asymptotic expansion of the summation of the appropriate harmonic 
series, either for the integers or the Gauss integers. An elementary exercise in 
analytic number theory connects the asymptotic expansion of harmonic series with 
special values of the corresponding zeta function. On a deeper level, the integrals in 
(4) can be explicitly evaluated using the general Parseval formula from [JoL 93b], 
which expresses the series @°'%(1/t) as integrals of classical functions times 
(euclidean) gaussinas. 
In summary, we thus obtain 


Theorem 3.2. The theta relation holds: 


eo (4t)~ 2 ONC (1/1) + OS (1/1) = Ocus (t) + 1 + Opis(t). 
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Readers may compare our explicit theta relation with Arthur’s version of Sel- 
berg’s trace formula [Art 74] in the context of representation theory, and adelization, 
with test functions having compact support (Assumption 3.5). See also Kubota 
[Kub 73], Appendix; [Zag 79] and [Szm 83]. 

We are ready to take the fifth step, which is to take the Gauss transform to 
yield a function L(s) satisfying a functional equation. Actually, this function is a 
logarithmic derivative 

L(s) = Z'/Z(s). 


If one carries out the above procedure with SL2(R)/K2(R) and a co-compact 
discrete subgroup I’, what one gets for Z is essentially the Selberg zeta function for 
a compact Riemann surface. This follows from a theorem of McKean [McK 72]. 

It is relevant to note here a comment of Iwaniec [Iwa 95]. He writes down the 
functional equation of the Selberg zeta function in multiplicative form (10.40) 


Z(s) = ¥(s)Z(1 — 5), 


and then says on pp. 169: 


If you wish, the Selberg zeta-function satisfies an analogue of the Riemann 
hypothesis. However, the analogy with the Riemann zeta function is superficial. 
First of all, it fails badly when it comes to development into Dirichlet’s series. 
Furthermore, the functional equation (10.40) resists any decent interpretation as 
a kind of Poisson’s summation principle. 


Readers can evaluate for themselves whether “the analogy with the Riemann zeta 
function is superficial.’ Whether decent or indecent, our procedure shows the way 
to get systematically a construction of zeta functions in the most general case of 
semisimple or reductive Lie groups. 

One can then apply the Gauss transform [JoL 94], already mentioned at the end 
of Section 1, to get a zeta object. We return to this below in higher rank. 


4 SL, and Eisenstein series 


We shall describe conjecturally the direction we are taking. We consider “ladders” 
of spaces of type [\G/ K, with appropriate groups G. To avoid fancier terminology, 
we describe some conjectures about the way certain spaces I’, \G,,/K,, fit into each 
other in the concrete case when 


G, =SL,(C), K, = unitary subgroup, IT, = SL, (Z[i]). 
We expect these to be both typical examples and also controlling the more general 


context with semisimple or reductive groups, in a manner which will be discussed 
in the last section. 
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We start with the fact that SL,, has the same type of Iwasawa decomposition that we 
saw on SL. We let G, = SL, (C) but usually omit the index n. Let 


U = subgroup of upper triangular unipotent matrices (1’s on the diagonal). 
A = subgroup of diagonal matrices with components a; > 0 fori = 1,...,n. 
K = subgroup of unitary matrices. 


Then the product map U x A x K + UAK = G isa bijection, called the Iwasawa 
decomposition. 

As on SL, we let B be the real trace form on the Lie algebra g of G, consisting 
of the matrices of trace 0. 

In addition, we have the Lie algebras n,a, €, where n is the vector space of 
strictly upper triangular matrices, a is the space of diagonal matrices, € is the 
space of skew-hermitian matrices, all of them with trace 0. We have the direct sum 
decomposition 


gH=n+at+eé. 


The conjugation action of G on itself induces a conjugation action (functorially) 
on g. Its restriction to A induces an action ¢,(A) on n, which is completely 
reducible into eigenspaces of R-dimension 2, with eigencharacters. In fact, let 
E;;(@i < j) be the matrix with (ij )-component 1 and all other components 0. Then 
Ej; ,1£;; (complex case!) are an R-basis of an eigenspace, and the corresponding 
eigencharacter y;; 1s determined by 


aEj;ja' = (a; /a;)Eij. that is Xij (a) = aj /aj. 


The conjugation action of A on n corresponds to the regular Lie representation of 
a on n, that is the homomorphism reg : a — End(n) such that reg(H)Z = [H, Z]. 
Thus corresponding to the Iwasawa decompositionG = UAK,g = n+a+€, we let 


R(n) = set of (a, n)-characters, also called the regular characters, 


these being the characters occurring in the semisimple decomposition of n over the 
Lie-regular action of a. These characters are the a = aj; such that for a diagonal 
matrix H = diag(h,,...,h,) we have 


aj; (1) = h; =hy, 
The multiplicative characters can be indexed by the additive ones, namely, 
a(log a) 


X = Xa and Yoq(a) = € so that y;;(a) = a”. 
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We let 
S(n) = subset of what are called the simple characters, 0; ;+1, (i = 1,...,1r). 


Every regular character is a linear combination with positive integer coefficients of 
simple characters. The simple characters form a basis of the dual space a. 

The theory of SL,, is built up from subgroups SL, with m < n as follows. Given 
an integer n = 2, let P denote a partition of n, that is 


n=n+...+n,;+41, letting r=rp 


with positive integers 11,...,”,;41.If£r = n—1, we deal with the maximal partition. 
We consider blocks of n; x n; matrices along the diagonal, with indices i ranging 
from 1 tor + 1. We let 


Up = subgroup of the unipotent triangular group with nonzero elements strictly 
above the square blocks, except for the diagonal element equal to 1. 


Ap = subgroup of the diagonal group A with positive diagonal elements that are 
constant in each block, the whole matrix having determinant 1. 


ry 


tI r+ 
Gp= SL, = [] Gn, = direct product of the block groups. 


i=1 i=l 


Kp = [| Kn, = unitary subgroup of Gp. 


i=1 


Thus both Gp and Ap have product structure, which we may write each as 


Gp = diag(Gy,,...,Gn,,,) and Ap = diag(aiIn,,...,4r411n,4,)- 
The components a; (j = 1,...,7 + 1) are subject to the determinant condition 
r+1 


i 
l]4@ =1. 
j=l 


We define a standard reduced parabolic subgroup of H to be a subgroup of the 
form 
P=UpApGp _ alsowritten P = UpApGp. 


This is a subgroup. Indeed, Gp and Ap centralize each other. Furthermore, Ap and 
Gp normalize Up, so P is a subgroup of G. The analysis we are considering on 
G/K is built up inductively from these subgroups. 

Let [ = [,, = SL, (Z[i]). Then P is a discrete subgroup of G, and the 
homogeneous space 


has finite volume for its Haar measure, i.e., the G-invariant measure induced from 
the action of G. 
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Let H be a subgroup of G. We use the notation 
Ty =TO4H. 


Thus we obtain the subgroups 'p, [y,, G,. Note that I'4, is the trivial group. For 
a reduced standard parabolic subgroup P as above, we have the semidirect product 
decomposition 

Tp =Tu,Te>.- 


Forn = 2, the group U is abelian. For n > 2, itis not abelian, but it is decomposable 
into a Jordan—Hélder sequence of abelian groups, actually vector groups. In any 
case, 'y\U and so l'y,\Up is compact, and behaves like a “non-abelian torus”, 
even though it is not even a group. 

The groups Ap, Gp, Kg,, UG, are block groups of the same type as A, G, K,U 
respectiely. We have their Lie algebras 


Np = Lie(Up), ap = Lie(Ap), GGp = Lie(Gp), tgp = Lie(KG,). 


One obtains the corresponding relevant characters, for instance, 


R(ng,) = subset of characters a € R(n) such that a(ap) = 0, or equivalently 
a“ = 1| foralla € Ap. 


R(np) = subset of characters a € R(n) occurring in the a-semisimple 


decomposition 
Np = ce NpPa, 


i.e., such that the w-eigenspaces np» are # 0. We have the disjoint union 
Rin) = R(nep) UR(np). 
We define the traces 


T=%Tr= >» m(a)a and 5(a) = et (loga) 


a€R(n) 
and 
Tp= > m(a)a and Sp(a) = e? "8%, 
aER (np) 
The half traces 


1 
p= at and similarly pp and pg, 


play a significant role, as lying the the middle of a “critical strip” familiar in classical 
contexts of analytic number theory. 
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We are ready for the main course, which concerns the Eisenstein series. Let P be 
a reduced standard parabolic of G = SL,(C), and Tl = SL, (Z[i]). Let y be a 
character on Ap. For € € anc (complexification of the real dual space of ap), we 
let y¢ be the character of Ap such that fora € Ap we have y;¢(a) = a‘. The 
product map 


Up x Ap X Gp/Kgp > G/K_ givenby (u,a,gKG,p) uagk, 
is a differential isomorphism. Therefore we have its three projections on the three 


factors, denoted by putting any one of them as an index, for instance z4,. We define 
the character Eisenstein series by 


II 


Ep(xc)(2) = Ep(,2) 


> (ai, 


yelp\T 


Yd) xe(2ar)- 


yelp\T 


II 


We must now describe a half space of convergence. We have remarked that the 
simple characters {a,...,@,} (r =n — 1) forma basis of aY. Similarly the simple 
characters @p1,...,@p,, form a basis of io. We let apy, Son OD rp be the dual 
basis of ay relative to the trace form originally given in a, restricted to ap, and 
inducing a form on ay, in the natural way. A real character  € ay is defined to be 
positive, written & > 0, if 


g= So oie; with o; > 0 for all i. 


This notion of positivity defines a partial ordering on ay. A complex character ¢ € 
ay ¢ can be written 


€=€&+id, where &,A € a} are real characters. 


We let & = Re(¢) and A = Im(¢). We can write ¢ as a linear combination 
c= >> sialp; with s; € C. 


Then Re(¢) > 0 is equivalent with Re(s;) > 0 for all 7. 


Theorem 4.1. For Re(¢) > 2pp, the Eisenstein series Ep(¢) is absolutely 
convergent, uniformly for z in a compact subset of G/K. 


This theorem for a much wider class of groups (semsimple) is proved in 
Langlands [Lgld 76], see also [Har 68]. Langlands’ monumental work proves the 
meromorphic continuation and functional equation in this general case, leaving 
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many details to the reader, so access is difficult. The attempt to fill a needed 
complement [MoW 94] does not really make Langlands’ theory easily accessible. 
One of the participants of the seminar giving rise to this attempt is referred to in 
the introduction as holding the opinion that “the purpose of the seminar was to 
render obscure what is not so clear.’ So foundational material to these theories is 
not yet in a form that provides relatively easy access, which is one reason why we 
are concentrating on the basic special case of SL,. It is not the only reason; see the 
comments at the end. 

Let Fo be a function on [¢,\Gp/Ke,. Let y = xz with Re(f) > 2pp. We 
define the Fo-twisted Eisenstein series E p(Fo,¢) by the series 


Ep(Po.O@= D> Follvaxc, 1D sp: 


yelp\C 


where we abbreviated Gp / Kg, = Xc,. The main twisting function we shall use is 
the heat gaussian or heat kernel, on which we now comment. 

First, on SLo, the group Gp is the trivial group for the standard parabolic UA, so 
the only way to define the heat kernel on Gp in this case is the constant 1. Twisting 
is not visible on SLzg. It is a construction which takes hold in higher dimension. 


The heat kernel 


We fix the real trace form on the Lie algebra of SL,(C). This determines a 
Riemannian metric on G/ K. We can define the Casimir operator exactly as we did 
on SLz, via a basis and the dual basis of the Lie algebra, and the corresponding 
invariant differential operator, again denoted by w. The heat operator is then 


H,; = —@, + 0;. 


We may also fix a Haar measure, usually taken to be the one associated with 
the metric volume form, but it is useful to allow the scalings depending on 
different choices of Haar measures, as we did on SL. The heat gaussian is 
then defined as a Weierstrass—Dirac family satisfying the heat equation. A general 
theorem of Dodziuk [Dod 83] guarantees its existence and uniqueness on a complete 
Riemannian manifolds with Ricci curvature bounded from below. However, the heat 
kernel can be reached on spaces G/K via spherical inversion theory. It was so 
determined and analyzed by Gangolli in the general case of semisimple Lie groups 
[Gan 68]. We carry this out in a self-contained way for SL2(C) in [JoL 03a]. In the 
complex case, it is obtained from a specific formula for the heat gaussian, which is 
the K-bi-invariant function such that for all b € A under Gangolli’s normalizations, 
and a bilinear form BY, 


eo llogal?/4t .—p7t I 20 a(log b) 
BY(a,t) sinh(a(log b)) 


1 
g,(b) = (4st )sim(G/K)/2 
aeER(n) 
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where p* = BY(p, p), and in the present case, dimG/K = n? — 1. The formula 
is written in such a way that it is the Gangolli formula in the most general case. 
Different normalizations may introduce a constant in front, as when we pick the 
Iwasawa Haar measure for SL». In any case, gaussians are defined as K -bi-invariant 
functions given on b € A by the formula 


p-llog dP /e Tl a(log b) 

aye sinh(a (log b)) 
with c > 0, and positive multiples of such functions. We may then first satisfy the 
Weierstrass-Dirac property by dividing such functions by their total integral, and 
finally adjust the remaining constant so the heat equation is satisfied, in a manner 
entirely like the one we described for SL2, thus obtaining the heat gaussian g,. One 
then defines the heat kernel by the universal recipe 


K,(z,w) = g,(zw). 


The heat kernel on a product space being the (tensor) product of the heat kernel 
on the factors, we have a heat kernel on Gp for each P. We have already commented 
on the convention in the completely degenerate case of trivial Gp. Since the heat 
kernel is a function of two variables, we are led to define the two-character heat 
Eisenstein series 


Eprx(t, 61, 62,2, W) 


QT 2 
= YOK nd xep (2) x6) (1D tp 2), 
vi.y2€0p\P 


In the simplest case of SL2, the heat Eisenstein series collapses to the product of 
two Eisenstein series as defined in Section 2, namely for P = UA, 


Eprxys(1, 62.2, W) = E01, E(%, w). 


There is no K or t. 

For simplicity, one may omit the subscripts XG, and Ap, viewing the heat kernel 
Kg, and the characters as defined on G/K via the projections on Xg, and Ap 
respectively. 

Let Fp be the function defined by 


Fp(t,6,€,2,W) = Kxo, (ap Wap Zip Wp 


For any eigenfunction F of the heat operator, with eigenvalue A, we define its heated 
function (satisfying the heat equation) by 


Ft=e"F, 
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Theorem 4.2. The function Fp in the variables (t,z) or (t, w) is an eigenfunction 
of the Heat operator, with eigenvalue —ev(, x) (minus the eigenvalue of Casimir 
on the character x;). This eigenvalue is explicitly 


Ape = ev(@, Xr) = BY (6,6) — BY (E, tp). 


The heated function F} satisfies the heat equation in the variables (t,z) and 

t,w). The Eisenstein series is an eigenfunction of the heat operator, with the same 
§ p 

eigenvalue as above, and its heated function 


Eprett, & é, Zw)= e Ps! Eorx(t, i? é zw) 


satisfies the heat equation. 


5 Conjectures on eigenfunction expansion, theta inversion 
and zetas 


In this section, we reproduce conjectures from [JoL 03]. 

By BC we abbreviate the property of being bounded continuous. Let f € 
BC(1‘\G/K). Let P be a standard parabolic subgroup of G. We define f to be 
P-cuspidal if 


/ f(uz)du = 0 forall zEG. 


Tup \Up 


Let £ > 2xp = Tp. We say that f is (P,&)-admissible if fEprx«(f, 6) is in 
L'(T\G/K) for Re(£) = &. We make the same definition for all € after assuming 
the meromorphic continuation of the Eisenstein series. 


Suppose f is (P,&)-admissible. The property that f is P-cuspidal is equivalent 
with the property that for Re(¢) = &, 


Eprx(é,0)* f =0. 


The above property is still phrased within the half space of absolute convergence 
of the Eisenstein series. We now need to assume the existence of a meromorphic 
continuation, so that we can work on the critical space Re(¢) = pp itself. 

We define L7,([\G/K) to be the closure of the subspace of L? generated 
by eigenfunctions of Casimir, and call this subspace the discrete part of L*. We 
define a function to be cuspidal if it is P-cuspidal for all P, and call the space 
of such functions the cuspidal space. It is contained in the discrete part of L”. Its 
orthonormal complement will be called the residual space L?, ((\G/K). 

For n = 2, the discrete part of L? is the cuspidal subspace plus the con- 
stants. For n > 2, it involves more, namely a bigger residual part, for which 
the only available basic reference is the “jungle” of Langlands [Lgld 76], §7. 
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Gelfand—Piatetski-Shapiro proved that convolution on the cuspidal space with an 
L! function is a compact operator, and Borel—Garland [BoG 83] extended this to 
the full discrete part. 


Conjecture 5.1 Suppose that for all € with Re(¢) = pp,t > 0,z € G we have 
(Eprx * f)(t,6,6,2 = 0. 


Then f is in the P-cuspidal + discrete subspace. 


The convolution is on '\G. The integral implicit in this convolution is taken over 
the second I'\G variable. As for the test function, one needs it in a space which will 
include the heat kernel, e.g. the space of gaussians. 

Pushing the need for analytic continuation, we define the anti-discrete kernel 


Jer pp t (ZW) = / Fe rx(t,6,,z, w)dIm(6) 
Re()=pp 


= / e 4?! Fork (t, 6 €,z, w)dIm(0). 


Re(t)=pp 
Conjecture 5.2 The mapt +> Jpr pp 4t(z,. W) satisfies the semigroup property under 
convolution on 1\G. For f in the appropriate space (containing the gaussians, 
possibly (P, &)-admissible for pp < & S (2+ €)pp) the function 
fe — lim CpIPLpp.t * if 
t>0 
is in the P-cuspidal subspace + residual space, for some constant Cp. 
Considering the anti-Eisenstein product 
[]@ - J») 


taken over all P, we are led to: 


Conjecture 5.3 There exist real numbers c’, such that the function 
pik: 
a y cp lim J. * 
f = Ps PJ .pp.t ‘a 


is in the cuspidal + residual = discrete subspace. 


We are now in a position to state the conjectural form of the eigenfunction 
expansion for the heat kernel. Let {y;} be an orthonormal basis for the discrete 
part of L?(['\G/K) consisting of Casimir eigenfunctions, so oj = Aj; Yj. 


The heat kernel, theta inversion and zetas on '\G/K 297 


Conjecture 5.4 For X = G/K, there exist constants c', such that 


Kx(.z.w) = lev @Y¥j WW) 
i#0 


+o + Sep es! Eprx(t.6,6,2, w)dIm((). 


P  Re(t)=pp 


In the above formulation, we took 7% for 7 = 0 to be a constant function. The 
value of the constants co and a depend on various normalizations. However, no 
matter what, co has to be 1/vol(['\G). 

Conjecture 5.4 is the conjectured formula for the eigenfunction expansion of the 
heat kernel. 

We have reached the stage of Theorem 2.2 on SL, (C). Then one has to carry out 
the regularization, the cancellation procedure between the cuspidal part Ks of the 
heat kernel and the Eisenstein integral, in order to reach the theta inversion formula, 
thus completing the first four steps listed in the case of SLp. 

The fifth step will consist in taking the Gauss transform to yield zeta objects with 
an additive structure. This will exhibit the ladder structure, in which the fudge terms 
of the functional equation of the zeta object L,, at level n will be mostly the zeta 
objects L,, at lower levels m < n, including the vey bottom object which will be 
¢1,;/¢11 (cf. Section 3 (3)), thus bringing in explicitly the number theoretic zeta. 
The other fudge terms will be gamma type functions, powers of 2, exponentials. 
The general theory of the Gauss transform from [JoL 94] Chapter V will apply. 

In this way, the theory of explicit formulas for regularized series merges with the 
theory of eigenfunction expansions on reductive Lie groups, starting with what first 
appears as a special case, the SL, ladder. This broader context was the principal 
motivation for the axiomatization that we started in [JoL 93], [JoL 94] to make 
the entire set up (regularized products or series and explicit formulas) applicable 
simultaneously to the cases of classical analytic number theory and the geometric 
cases that arise from groups like SL,, (reductive groups). 

Note that at levels higher than n = 2, the main contributions to the fudge 
terms arise from the parabolics, which are like the original group SL,, but of lower 
dimension or rank. Thus the fudge terms, not factors because we are carrying on 
in an additive setting, correspond to zeta objects associated with lower steps in the 
ladder, in the present case the SL,, ladder. 

We view spec(Z|i]) as lying at the bottom of the geometric ladder, that is at 
the first level. The existence of such a ladder shifts the focus of attention from a 
single level to the way all levels affect each other, especially the way the bottom 
level interacts with all levels (providing a fudge factor), emphasizing the number- 
theoretic significance. 

The G,,/K,, having negative curvature, it is fruitful to view spec(Z[i]), or spec(o) 
for the ring of integers of a number field, as having “negative curvature,’ whatever 
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that means. This point of view led us to conjecture that every abelian subgroup of 
the Galois group of the algebraic closure of Q is topologically cyclic, in analogy 
with Preissmann’s theorem in differential geometry. It then turned out that the 
above statement is a theorem proved by Geyer in the late 60s [Gey 69], thus giving 
a strong indication that the heuristic point of view may have a more substantial 
content. 

In any case, we may summarize the above pattern with 


Gn = SL, (C), X, = Gn/ Kn, T;, = SL, (Zii]) 


assigning zeta object to [,,\X, = T',\Gy/ Ky by the five steps procedure: 


e Start with the heat kernel on X,,. 

¢ Periodize by T, to P,\Xn. 

e Give explicitly the eigenfunction expansion of the heat kernel. 

¢ Regularize the divergent terms to get a theta inversion relation, by integrating 
over I), \Gy. 

e Apply the Gauss transform. 


6 Further connections 


Representation theory per se forms a motivating force for a whole establishment. 
We are motivated differently, namely by the development of zeta functions via 
theta inversion relations, and their analysis via regularized products, regularized 
series, and explicit formulas. It has been realized in different contexts (Selberg 
trace formula [Sel 56], Gangolli’s construction of the heat kernel on general G/ K’s 
with co-compact discrete I") that theta inversion formulas can be viewed as part 
of a much larger context, stemming from the theory of semisimple Lie groups, 
symmetric spaces, and the heat kernel. Roughly speaking, the general setting for 
what we described above is that of such groups which have a structure like that 
of SL, but more (or occasionally much more) complicated. If G is such a group, 
its associated symmetric space is G/K, where K is the (compact) subgroup in the 
Iwasawa decomposition G = UAK. One essential aspect of these spaces is that they 
are Cartan—Hadamard: complete, simply connected, with seminegative curvature in 
the language of differential geometry. The subgroup I is a discrete subgroup, which 
may or may not be such that [\\G is compact, but always volume([I'\G) is assumed 
finite. Of special interest are “‘arithmetic” subgroups such as SL,(Z) in SL,,(R) 
and SL,, (Z[i]) in SL, (C), which are not co-compact and introduce number theory 
in various ways, from the bottom up, as we saw in Section 5. 

In the Iwasawa decomposition, U is unipotent (its elements are exponentials of 
nilpotent elements in the Lie algebra), and in particular solvable. The group A is 
isomorphic to a product of positive real multiplicative groups. The subgroup K is 
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maximal compact, or better unitary for an appropriate scalar product just as on SL,,. 
If n, a, € are the respective Lie algebras, then n is stable under the conjugation action 
by A, and thus U is normal in UA. The Lie algebra n decomposes semismply into 
1-dimensional subspaces, giving rise to eigencharacters a just as for SL,,. A positive 
definite scalar product is given a priori on a, and there is a polar decomposition 
G = KAK, just as for SL,, so that for b € A, one has logb € a and the norm 
| log b| is defined. 


Gaussians. One may thus define gaussians when G is a complex group just as 
we did in the case of SL,,(C). Gangolli [Gan 68] showed that the heat kernel in 
this complex case is a gaussian with the formula we gave in Section 4. Gaussians 
actually serve (at least) three purposes: 


¢ One of them is to serve as test functions in a new general development of 
the theory of semisimple or reductive Lie groups. Indeed, we expect the space 
generated by the gaussians to be dense in anything one wants (for the case of 
SL», see [JoL 03b]). 

e Another is to provide the basis for explicit formulas in this theory. 

e The third is to lead immediately into the heat kernel. 


The controlling effect of the complex case. Gelfand—Naimark first treated the 
representation theory in the complex case of the classical groups [GeN 50/57], and 
Harish-Chandra completed this for all complex groups, and then all real groups 
{Har 54], [Har 58a] and [Har 58b], by means of the Harish-Chandra series, taking 
his motivation from linear differential equations. This method was followed in the 
standard references [Hel 84], [GaV 88], and also in [JoL Ola] for SL,. However, in 
Chapter XII of [JoL Ola] for spherical inversion, we suggested the possibility of an 
entirely different approach to the general case, having its origins in the Flensted- 
Jensen method of reduction to the complex case [FIJ 78], [FlJ 86]. This program 
is in the process of being carried out, using the normal transform and its relation 
to spherical inversion and the heat gaussian, as investigated in collaboration with 
A. Sinton. As we developed, the Flensted-Jensen method can be placed in a much 
more general context of a totally geodesic embedding or an Iwasawa embedding 
(compatible with the Iwasawa decomposition up to conjugacy) of one space G1/K, 
into another G/K. Then the analysis on G,/K, is related to the analysis on 
G/K by a commutative diagram, via a projection operator from the larger space 
to the smaller, namely integrating over the normal directions. For instance, the 
heat kernel of G,/K, is so obtained form the heat kernel on G/K, and so is the 
theory of spherical inversion. This also gives a rapid insight into the real case as a 
“homomorphic image” of the complex case. 

Since any semisimple Lie group can be naturally embedded into some SL,, (C) 
via its Killing representation (conjugation representation on its Lie algebra), it 
thus appears that SL, (C) is not only an example but a controlling object for all 
semisimple Lie groups. Thus the pedagogically effective approach of working out 
the concrete case of SL,, (C) first is also mathematically effective to get at the general 
case. 
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Ladders. The natural way groups SL, are embedded in each other is part of a more 
general system giving rise to other kinds of what we call ladders. Each symmetric 
space G,/K, is embedded in a G,+1/K,+1, in a totally geodesic way, preserving 
the Iwasawa decomposition (after possibly a conjugation). Thus we may have a 
sequence of geometric objects which can be displayed vertically as a ladder: 


Gn4i/Kn41 
Gn/ Kn 
Gn—1/Kn-1 


On the other hand, we have the ladder of associated zeta functions L,,(s), according 
to the general five steps summarized at the end of Section 5. The L-notation 
suggests logarithmic derivatives as well as classical L-functions. The fudge terms, 
written additively in our set up so we do not say fudge factors, in the additive 
functional equation of our zeta objects (logarithmic derivatives), will conjecturally 
come mainly from the zeta functions of lower level in the ladder. Other fudge 
factors will include higher-dimensional versions of gamma functions. Thus we 
have a zeta ladder in parallel to the ladder of spaces. The spaces G,,/K, are not 
compact, but can be compactified by the spaces Gj, /Ki, with m < n. A special 
case arises when these come from parabolic subgroups. Thus the occurrence of 
functions L,,(s) as fudge terms for L,(s) reflects the geometric construction of 
compactification. 

Furthermore, a new connection arises between classical number-theoretic objects 
and objects coming from geometry and analysis, because classical Dedekind zeta 
functions will occur as fudge factors of geometric zetas. In particular, to so-called 
“trivial” zeros or poles at a given level, i.e. those belonging to the fudge factors, are 
the main zeros or poles of lower levels. Thus the zeros of the ordinary Riemann zeta 
are “trivial” zeros for higher level zetas. More appropriately, they might be called 
fudge zeros for higher levels. 


Connections with geometry. Similar ladders should occur with certain types of 
spaces arising from algebraic geometry and differential geometry, in various ways. 

(a) First, moduli spaces in algebraic geometry have a tendency to be of type 
I'\G/K orto be naturally embeddable to some ['\\G/ K (G reductive or semisimple, 
and K the unitary subgroup). For instance, the moduli space of curves of genus 
=2 is embedded in the moduli space of its associated jacobian because of Torelli’s 
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theorem (two curves are isomorphic if and only if their jacobians are isomorphic). 
As for abelian varieties, their moduli spaces are obtained as quotients of G/ K where 
G is a symplectic group Sp, (with g equal to the dimension). 

Another possible ladder over the moduli space of curves of genus | starts with 
the moduli space of K3-surfaces, which is a [\G/K corresponding to the group 
G = SOo(2, 19), followed by Calabi-Yau manifolds moduli spaces, with their more 
complicated moduli structure embeddable in [\G/K’s, or also possibly involving 
(b) below. Finally, we mention the moduli ladder of forms of higher degree as in 
a paper of Jordan [Jor 1880]. Thus analytic properties of Mellin(@) (essentially a 
spectral zeta) or of Gauss(@) (a broader context for Selberg’s zeta), associated to 
such spaces will conjecturally get related to the algebraic geometry and differential 
geometry of such spaces in ways in which both complement those in the past, and 
new ways in the future. 

In any case, the geometric ladder and the ladder of zeta functions reflect 
each other, thereby interlocking the theory of spaces coming from algebraic and 
differential geometry, with analysis and a framework whose origins to a large extent 
stem from analytic number theory. On the other hand, for some purposes, and in any 
case as a necessary preliminary for everything else, the purely analytic aspects have 
to be systematically available. 

(b) There are other manifestations of ladder-like stratifications. A theorem of 
Griffiths [Gri 71] states that given a projective variety V over C, there is a Zariski 
open subset which is a quotient of a bounded domain (of holomorphy), and this 
bounded domain is C® isomorphic to a cell (euclidean space). We propose to go 
further, namely that Zariski open subset can be chosen so that its universal covering 
space is real analytically a G/ K with G semisimple or reductive, so the Zariski open 
subset is a [\G/K with discrete . Thus any projective variety could be stratified 
by aI'\G/K. This gives rise to the possibility of considering the classification of 
varieties or manifolds via stratification structures by such [\G/K. In particular, 
what is the minimal Zariski closed subset of the moduli space of Calabi-Yau mani- 
folds which one has to delete to get the complement expressible as a [\\G/ K? How 
does this symmetric space relate to the symmetric space in which the moduli space 
i naturally embedded? Same question for the moduli space of curves of genus >2. 

Topologists have concentrated on the classification problem via connected sums, 
but we find indications that the stratification structure and its connection with 
eigenexpansion analysis deserves greater attention. Thurston’s conjecture for 3- 
manifolds fits into this ladder scheme. 

Having a stratification as suggested above, one may then define an associated zeta 
function following the five steps listed previously, and related the analytic properties 
of these zeta functions with the algebraic-differential geometry of the variety. 

(c) The trace formula and the spectral zeta function in connection with index the- 
orems, have some history dating back to the seventies and eighties. We mention only 
a few papers: Atiyah—Bott—Patodi [AtBP 73], Atiyah—Donnelly—Singer [AtDS 83], 
Barbasch—Moscovici [BaM 83], Miiller [Mul 83], [Mul 84], [Mul 87]. For a more 
complete bibliography, cf. Miiller’s Springer Lecture Notes [Mul 87]. These papers 
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are partly directed toward index theorems and the connection with number-theoretic 
invariants, as in the proof of a conjecture of Hirzebruch in [AtDS 83] and [Mul 87]. 
In retrospect, we would interpret the Atiyah—Donnelly—Singer paper as working on 
two steps of a ladder, with the compactification of one space by another, and the 
index theorem being applied on this compact manifold. A reconsideration of the 
above mentioned papers in light of the present perspective is now in order. 

Readers can compare Miiller’s formula for the heat kernel [Mul 84] Theorem 
4.8 and [Mul 87], (9.5), obtained in the context of functional analysis, with our 
explicit formula. Working as we do in the complex case, where it is possible to use 
an explicit gaussian representation for the heat kernel, and using the Gauss transform 
rather than the Mellin transform, puts a very different slant on the whole subject, and 
allows us to go in a very different direction, starting with the explicit theta inversion 
relation and its Gauss transform. 


Towers of ladders. The structure goes still further. To concentrate on certain aspects 
of analysis in the simplest case of G/K, we already picked the number field Q(i) 
instead of Q itself, so we used Z[i] instead of Z. However, one may consider an 
arbitrary number field, and the Hilbert-Asai symmetric space associated with it 
[Asa 70], [JoL 99]. Thus we may go up a tower of number fields {F,,} (finite 
extensions), and then ladders over these {Gy.n/Kmn}, giving rise to a tower of 
ladders, so a quarter lattice combining even more extensively geometric structures 
with classical number theoretic ones. 

Going up a tower ipso facto introduces questions of number theory. Already for 
quadratic fields, the Eisenstein series gives rise to a whole direction as in [EGM 85], 
[EGM 87], [EGM 98], with the theory of special values. 

However, we place emphasis on the extent to which the geometric structure is 
affecting the number theory by having number-theoretic objects at the bottom of 
geometric ladders. The main new question, as we see it, is how does the existence of 
a rigid sequence of zeta functions up the ladder force certain regularities of behavior 
on the arithmetic bottom? 
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Applications of heat kernels on abelian groups: 
¢(2n), quadratic reciprocity, Bessel integrals 


Anders Karlsson 


In memory of Serge Lang 
Abstract The discussion centers around three applications of heat kernel 
considerations on R, Z and their quotients. These are Euler’s formula for ¢(27), 
Gauss’ quadratic reciprocity law, and the evaluation of certain integrals of Bessel 
functions. Some further applications are mentioned, including the functional equa- 
tion of Riemann’s ¢-function, the reflection formula for the /"-function, and certain 
infinite sums of Bessel functions. 
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1 Introduction 
It was a well-known open problem at the beginning of the 18th century to determine 


the value of 
lo, e) 
3 be 
rc 
k=1 k 


In fact, Wallis and Leibniz failed in their attempts and the question was much 
discussed among the Bernoullis. It was therefore a sensation when the solution came 


A. Karlsson (1) 

Section de Mathématiques, Université de Genéve, 2-4 rue de Li€évre, case postate 64, 1211 
Genéve, Switzerland 

e-mail: anders. Karlsson @unige.ch 


D. Goldfeld et al. (eds.), Number Theory, Analysis and Geometry: 307 
In Memory of Serge Lang, DOI 10.1007/978-1-4614-1260-1_14, 
© Springer Science+Business Media, LLC 2012 


308 A. Karlsson 


in 1734 from the young Euler, who later also found the general formula for ¢(27), 
see Theorem | below. 

Now consider instead the problem of solving quadratic equations mod p. A 
general quadratic equation reduces to studying 


x? = q mod p 


for any two distinct primes p and g. The main theorem for answering when this 
equation has a solution is the quadratic reciprocity law proved by Gauss in 1796; 
see Theorem 3 below. 

It is a striking fact that both these two classic theorems of number theory, on 
the surface so different in character, can be deduced from one single analytical 
formula. We will see this in Sections 3 and 4. The analytical formula in question is 
the classical Poisson—Jacobi theta inversion identity which expresses the heat kernel 
on R/2zZ in two ways. This proof of Gauss’s theorem is known and can be found 
in e.g., [4], while the deduction of Euler’s evaluation of €(2n) appears to be new 
(this proof is analogous to how Selberg’s zeta function with functional equation is 
derived in [17]). 

In Section 6 we will moreover see how to evaluate integrals of Bessel functions 
such as 


a In(t)dt or a In(t) Im (x —t)dt 
0 0 


through a determination of the heat kernel on the space consisting of two(!) points. 

Of course there are several other extraordinary applications of heat kernels and 
theta inversion, even on R; see e.g., [15]. Here I selected the evaluation of ¢(27) 
because the proof is both appealing and suggestive, and I chose to include the case 
of quadratic reciprocity because Lang liked it particularly much and he told me that 
one should try to do the same to every theta inversion in sight. 

As will be clear, the approach is influenced by the ideas of Jorgenson and Lang. 
In Sections 2 and 5 I try to put the material in the framework of their program, where 
R and Z correspond to the lowest (or next to lowest) levels in the ladder structures. 
See [12] and [13] for more details on this. 
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2 Theta inversion on R 


The Poisson summation formula is usually stated in the following way: 


fo = > 7): (1) 


n=—Co n=—oo 
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where f is the Fourier transform of /, an infinitely differentiable function such 
that f and all its derivatives decrease rapidly at infinity, which means that 
lim)x|-+00 |x|" f(x) = 0 for every m, k > 0. Although elegant as this 
formula no doubt is, it comes especially alive when one takes f to be the heat 
kernel on R, 


KGa)= = ear, 
V4at 
Actually, as Lang pointed out to me, two important features are left out in the “roof 
formula” (1) as compared to e.g. (2) below: first, the spectral expansion on the 
quotient present in the proof is hidden, and second, the crucial t-variable structure 
is missing. In more detail, we start with K™(t, x), which we periodize to make it 
2m -periodic in x (see e.g., [15]). As such it has a Fourier series expansion, and one 
has after a computation of Fourier coefficients that 


i =< L << 
—(x+2nn)?/4t _ ft 2 —n7t inx 
e = e é: (2) 
4nt n=—o0o 2 n=—oo 
Now specializing by letting x = 0 one gets the Poisson—Jacobi theta inversion 
formula: 
1 — 2 i 
enn n/t es p> ert, (3) 
Ant n=—-Co 2n n=—oo 


proved for ¢ > 0, but a posteriori valid for Re(t) > 0 since both sides are analytic 
in that region. Riemann attributes this formula to Jacobi, who in turn attributes it to 
Poisson (see [7, p. 15]). Note that (3) is what we would get from (1) with f(x) = 
K®(t, 27x). 

Define the theta function 0(t) = 772 
in a more compact form 


ek Then the identity (3) becomes 


—0o 


1 1 


which explains the name theta inversion. 

In the many applications of these formulas ¢ plays a crucial role. The theorems of 
Euler and Gauss are discussed below, and then there is also the original Riemann’s 
meromorphic continuation and functional equation of his zeta function, which is 
recalled without the proof in Section 3. 

Note that although these theorems of Euler, Gauss, and Riemann are discussed 
in most basic textbooks on number theory (e.g., [3], [7], [11], and [19]), it seems 
that nowhere is it pointed out that, remarkably, all three are consequences of the 
Poisson-Jacobi theta inversion formula. Considering this, we may be well-advised 
to study analogs of this formula more closely, which is what we do in Section 5 
although only to a modest extent. 


310 A. Karlsson 
3 Special values of Riemann’s zeta function 


What follows is a proof of the following theorem: 


Theorem 1 (Euler). For any k > 0, 


cama 1 (20)** Box 


SOR Dae 56m 


n=1 


’ 


where B,, denotes the Bernoulli numbers. 


Recall that the Bernoulli numbers B; are defined via 


x = x* 1 = xk 
aoe Sie eo 
Fae nae >k Oy! 


See [11, Ch. 15] for more information on Bernoulli numbers, and in this reference 
it is also remarked that the theorem above “constitutes one of [Euler’s] most 
remarkable calculations,’ which in Euler’s case does not mean little. Since B, = 
1/6, By = —1/30, and Bs = 1/42, we get for example that (2) = 27/6, 
£(4) = 2*/90, and €(6) = 1°/945. 

Let f be a measurable function such that | f(t)| = O(e') for some b as t > oo. 
The Gauss transform of f(t) following Jorgenson—Lang, see e.g., [16, p. 301] or 
[13, p. 1], is 


Gf(s) = 2s i a f(tje dt 


and is an analytic function in s for Re(s*) > b. From [6, p. 25] one has that the 
Laplace transform of 


1 2 1 
e* /* fora > 0, is eave, 


Jat vo 


Now if we take the Gauss transform of the left-hand side LHS of (3), we get 
for s > 0, by repeatedly interchanging the order of sums and integrals (justified 
by absolute and uniform convergence of the series and integrals in question), 
that 


lo) co 1 rs a 1 
2,2 2 
G(LHS =2 g/t os ‘dt= + —2n|n|s 
core sv ye ° 2 2. 5° 
n=—00 H=H=05 
—21s _ 1 + e72ms 


= e 
_ —2n|n|s _ 
— S 7 =e (ae 


n=—Co 


Applications of heat kernels on abelian groups: ¢(2), quadratic reciprocity, Bessel 311 


The right-hand side RHS becomes 


Is oo lo) 3 5 2s oo —(n?+s7)t 
G(RHS =— i ee ——_— 
(RHS)(s) = ss [ ere an > | ’ 


| 


II 

i) 

y|- 
| led! 8 
8 

ae 

+|8 

eo 


Therefore we have that the Gauss transform of the theta identity on R gives: 
Proposition 1. For real s £ 0, 

I+e" 1 3 2s 
1—e-2"5 On gs? +n?’ 


n=—oo 


We now expand both sides in series expansions in s, for small s > 0: 


1 2s to Dey 6 tL, 2 G/ny 
- see i pte lees wt elie 
PL Seems se+n? ms ms oa or s? ws mS = 1+ (s/n) 


2 


== + 2 Sey 


n=1k=1 


2 = ee 
5 23(S 2) — 


n=1 


On the other hand, in view of the definition of B,,, the left-hand side becomes 


1+ e275 2 1 -2zs 

——— = -l- = = -1l+ —- = 

1 — e72ns e72ms = 1 1S e72ms = 1 
(—275)* 


1 
ee eae | B 
eae = 2 


1 t (Qn)*s™* 
Se ae Bop ———— 
WS = TS d si (2k)! 


Hence for integers k > 0 


3 1 _ GDF Gn) 2 


nk ———«2(2k)! at 


n=1 


which proves the theorem. 
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Let &(s) = 2~*/*'(s/2)€(s). It is worth recalling here that Riemann applied the 
Mellin transform to the theta inversion on R, see [7, pp. 15—16] or [15], proving: 


Theorem 2 (Riemann). The function &(s) admits an analytic continuation for all 
s #0,1 and 


&(s) = €U—s). 
From this and Theorem 1, it follows that 
B N 
¢(1—2n) = ——, 
2n 


and, because of the poles of I’, that at the negative even integers ¢(—2n) = 0. These 
special values were found by Euler in 1749. 


4 Quadratic reciprocity 


We consider the following equation: 
x? = q mod p 


for any two distinct primes p and g. The Legendre symbol 


is defined to be 1 if the above equation has a solution for some integer x and 
—1 otherwise unless g = 0 mod p:p in which case the symbol is 0. The quadratic 
reciprocity law is: 


Theorem 3 (Gauss). For any two distinct odd primes p and q, 


(2) (4) = (Deen, 
q P 


Euler stated the theorem in 1783 but without proof. Legendre wrote only a partial 
proof, and the first correct proof was published by Gauss in 1796. This theorem was 
perhaps Gauss’s favorite in number theory, which is also indicated by the name he 
attached to it: theorema aureum — the golden theorem. 

The proof we present is based on a beautiful formula, due to Schaar from 1848, 
and which is of independent interest. It will here arise as the asymptotics expansion 
in the theta inversion formula for ¢ = ¢ + ip/q, € — 0. We follow Bellman [4], 
who attributes this proof to Landsberg. A similar method of proof was employed by 
Hecke [10] to establish quadratic reciprocity for an arbitrary number field; see also 
[8], [3], and [18]. This might indicate that it is one of the better proofs out of the 
hundred or so published proofs of Gauss’s theorem. 
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Let 
q-1 


S@.q)i= > ere, 


r=0 


Proposition 2. Let p and q be two relatively prime integers. Then 


1 —in/4 
—S(p,q) = S(q, Pp), 
Vd JP 
or written out in full, 
q=l ~in/4 P=} 
i eitk? p/q i > einale 
q k=0 VP /=0 
Proof. Let 
[oe 
O(t) = > et t = 1 + a ae 
k=—00 k=1 


For ¢ > 0 we have 


q-1 


a5 = 
6(e+ip/q) = 142) eth ee ink’ pla - 14257 
k=1 k=0 


foe) 

= 2 = 2 

e ink? p/q : e w(k+lq)-e . 
1=0 


The inner sum can be interpreted as a Riemann sum as ¢ — 0 so that 


Se ee -| en mk+x4)"€ Jy of o(1) = e-” dw + 0(1) 
1=0 0 mq Jt Ink Je 
1 
mgq/e\ 2 
Hence 
1 fT 1 

6 j = 1+ 2——~ | — 1) ) S(p,.g) = — = (S(p, 1 

(e+ip/q)=1+ —z( 5 )) (p,q) ae (p,q) + o(1)) 
ase —> 0. 


On the other hand, start by noting that 


1 1 € . P/G q  .4 2 
= — —1 =E€E —!1 +O tome IF 
t etip/q @+p*/q? e+p/q pp? p — 
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Therefore by the same argument, although with some extra care due to the presence 
of O(e7) above, we get the asymptotics as ¢ > 0 for 
1 1 1 
o(—-)- (S(-4, p) + 0(0)) = = 
e+ ip/q pV 7Eq?/ p? qV TE 


Finally, in view of that 


1 1 = qd 
= _ im/4 
= ———_ =e —+o(1) 
Vt Jet+ip/q \ p 
and comparing the two asymptotics in the theta inversion formula (4), the proposi- 
tion is proved. oO 


(S(q, p) + o(1)). 


Let the quadratic Gauss sum be 


m—1 
G(n,m) = S(2n,m) = Se ei2nr’nfm 


r=0 
We have: 


Lemma 1. Let p and q be two distinct primes. Then 


G(1, pq) = G(p, q)G(q, p). 


Proof. Note that k? p? + 1?q? equals (kp + 1q)? mod pq, so we see that 


G(p,q)G(q, p) = 5 eleatal 5 etalp = > bs pti) ei2nl?q/p 
k=0 1=0 1=0 \k=0 
p-lq-l 
= ei2m(k? p?+1°q?)/pq — G(, pq). 
1=0 k=0 
since kp + /q runs through all the values 0 to pq — 1 mod pq exactly once. oO 


The connection to the Legendre symbol comes next: 


Lemma 2. Let p be an odd prime and assume that p does not divide n. Then 
n 
G(n, p) = (=) CCl By 


Proof. This is a simple calculation keeping in mind that as r runs from 1 to p—1, r? 
mod p goes through all the quadratic residues Q, exactly twice because (p —r)? = 
r* mod p: 
G(n, p) =a ee) ye ei2akn/p 
keQ 
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Now if 7 is a quadratic residue then clearly kn is a quadratic residue, and so 
n 
Ga, p) =14+2)0 e?*™/? = GQ, p) = (“) G(1, p). 
P 
meQ 


On the other hand, ifn is a quadratic nonresidue then kn runs through the quadratic 
nonresidues Q’, and we get 


; ‘ n 
G(n, p) =1+2 » ei2nm|p =-] ee ae = (=) G(, p), 
méeQ’ leQ z 
where the second equality comes from the evaluation of a geometric series: 
p-l 
1+ x ei2rm/p 4. > ei2am/p _ Seer? —0. oO 
méeQ meQ’/ m=0 


We now prove Gauss’s theorem. First we have using Proposition 2 for an odd 
number m that 


G(1,m) = S(2,m) = eal 4 gor) = por—-I"/4 fn, 


In view of the two lemmas we finally get 


(2) (4) _GPaGap)_ _ GU.pqg_ (-1yr-da-b/4 
q/)\p G(l,qg) GU,p) G(1,q)G(., p) 


as required. 


Finally, note that also the two so-called supplements come out directly from 
Proposition 2: 


ee 
a1) _ GO. a) _ COPPA VE _ yen = yye-9r 
Pp} G(l,p) ie fp 


and for an odd prime p, 


) G(2, p) VP gin/4 (1 + eitp/4 4. inp 4 eix9p/4) 
p) G(l,p) — ie-P/A fp 
eit(Pth/4 4 gin(p+l)/4 


_ — (_1)(’-D/8 
7 2j (P-1)?/4 =C) 
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5 Theta inversion on Z 


The heat kernel K“(t, x) on Z is the fundamental solution of 


(4+ x) f(t, x) = 0, 


where i 
Ag(x) = g(x) - 78 = 1) + g(x + 1)). 


It is easily verified that 
K*(t,x) = e*1,(1), 


where x € Z, t > 0, and J is the Bessel function 


iad v+2k 


z 
I,(z) = ; 
@) dX DWKIP(y +k +1) 
(The relation to the more standard J-Bessel funtion is [,,(z) = (—i)”" J;, (iz).) This 
can basically be found in Feller [9, pp. 58-60], see also my paper with Neuhauser 
[14] for a discussion. When passing to a quotient Z/mZ, we obtain the analogy of 
(2), except for a cancellation of the factor e~“, 


[exe) 1 m—1 
~ Fptala) pas ecosnj/m)z+2nijx/m (5) 
m 


k=—0o j=0 


for any z € C and integers x and m > 0, as was proved in [14]. Specializing to 
x = 0, we have the theta inversion formula on Z, 


00. 1 m—l 
» Tkm (z) Se > ecos(2aj/m)z. (6) 
k=—o0o m 7J=0 
or in more perfect analogy with (3), 
iced 1 m—l 
e! 9G) a —2sin®(aj/m)t_ 
> km ( ) 7 pa e 
k=—00 j=0 


The beautiful formula (6) was in fact established earlier by Al-Jarrah, Dempsey, and 
Glasser [2] (compare also with Theorem 9 in [5]) by a very different method. Note 
however that these formulas do not seem to have been noticed previously in the vast 
classical literature on Bessel functions. 

If we take the Gauss transform on this identity (now again multiplied by e~‘) 
it is possible, see [14], to get an explicit formula, which is thus the analog of 
Proposition 1: 
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Proposition 3. For real s 4 0, and m > 0 an integer, 


25 1+ (s? +1 - vst +25?) 1 ml 25 


[s¥ +25? 1 _ (2 41 — Js Ost 23)" armed s? + 2sin?(aj/m) 


This is the logarithmic derivative (up to the factor m) of a Selberg-type zeta 
function Z2/""2 (the analogy coming from [17]). In this way we obtain the following 
[14]: 


m m1 g2 
2?-™ sinh? (S-arecosh v+1 ) = ms? (1 + aie) =; Z4/"Z(5 
2 au 2 sin?(xn/m) (s) 


which holds for any s € C. 
In the case of R, one gets (cf. the remarks on p. 5 in [13]) in an analogous fashion 


lore) 2 
2sinh zs = 2s |] (1 + =) =i ZEB (5), (7) 
nN 


n=1 


This in turn can be recast into the well-known reflection formula due to Euler: 


Proposition 4. Forz€ C\ Z, 


: =I (zl —z). 
sin 1Z 


Proof. Recall that the gamma function can be defined through a Weierstrass product 
(where y is Euler’s constant): 


1 gel i (1+) ew/n 
T(z) n=1 n ; 
In view of this and using (w+ 1) = wI(w), the formula (7) with s = iz becomes 
the desired identity. Oo 


I hope this brief discussion further illustrates the wealth of identities which come 
out of formulas like (3) or (6). 


6 Integrals of Bessel functions 


Already the fact that e~'/,.(t) is the heat kernel on Z gives an alternative way of 
looking at Bessel functions; for example the addition theorem 


Jn(t + 5) = > Ink (t) Jk (s) 


k=—0o 


becomes obvious if one thinks probabilistically. 
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From one of the basic recurrence formulas for J, 
Jy-1(2) — Jui = 273 @), 
one can deduce that [1, 11.1.2] for v > —1, 
x lo.) 
i Jy (t)dt = 2 Joxav4i(X). 
9 k=0 


In the cases where v = / an integer, this latter sum can be simplified to a finite sum 
from the theta inversion formula for Z with m = 2, recalling that J_, = J,. One 
gets that 


x x /-1 
[saat = [soar - 2 drxsrto 
0 0 k=0 


l 


[fo taodt = 1 = 0) -2 0 Jarl, 
0 


k=1 


which are [1, 11.1.3] and [1, 11.1.4] respectively. 
Other examples, even more adapted to our formula, are the convolution-type 
integrals 


i Ji (t)In(x — t)dt. 
0 


Here the following formula holds [1, 11.3.37]: 
x oo 
[snl —pat = 2D arson), 
? k=0 


for integers /,n > 0. We now carry out an example of how to compute this in detail. 
First we rewrite the sum in terms of /-Bessel functions: 


x [o,@) 
/ I(t) In(x — t)dt = 20D De rangi (—ix) 
k=0 


[oe 
Tose 
S20" y Tox -41-4n41(—iXx). 
k=0 


We continue, but now assuming that / + 7 is even, and then using (5) with m = 2, 
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x i lore) (J-++-n)/2-1 
[ s@mepar= Cn (iO penein 27 YO ei(-ix) 
0 k=—00 k=0 
ie r (J+n)/2-1 
=I)? [Ze +e") — 27 DT Desi (ix) 
k=0 
ie (J-+-n)/2-1 
=(-1)? [sinx-2 DP (1) resi) 
k=0 
Similarily, if / + 1 is odd, one gets 
x jp (I+-n—1)/2 
/ I(OIn(x = t)dt = (-1)*F* [ cosx + Jo(x)-—2 YD (DE Sex) 
0 k=0 
In the special cases n = —/, orn = 1 —1 with] = 0 these formulas become (com- 


pare with [1, 11.3.38] and [1, 11.3.39] which hold also for nonintegers —1 < / < 1) 


[ Jo(t) Jo(x — t)dt = sin x, 


0 


fi Jo(t) Ji (x — t)dt = Jo(x) —cosx. 


0 


7 Personal remarks 


I had the great privilege to attend in total around ten semesters of mathematics 
courses taught by Serge Lang. Like many others I am grateful to him for his 
teaching, generosity, and constant encouragement. During the spring semester of 
2005, when I was his office neighbor at Yale — a very special and interesting 
experience in itself — we often had conversations on topics related to the present 
paper. I was struck by the sad news of his death on September 12, 2005. I miss 
Serge, in particular his great sense of humor, and I feel fortunate for having known 
him. 
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Report on the irreducibility of L-functions 


Nicholas M. Katz 


Dedicated to the memory of Serge Lang 


Abstract In this paper, in honor of the memory of Serge Lang, we apply ideas of 
Chavdarov and work of Larsen to study the Q-irreducibility, or lack thereof, of var- 
ious orthogonal L-functions, especially L-functions of elliptic curves over function 
fields in one variable over finite fields. We also discuss two other approaches to these 
questions, based on work of Matthews, Vaserstein, and Weisfeller, and on work of 
Zalesskii-Serezkin. 


Key words Q-irreducibility * L-function ¢ elliptic curves 
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1 Introduction 


By the pioneering work of Dwork [Dw-Rat] and Grothendieck [Gr-Rat], we know 
that zeta functions of varieties over finite fields, as well as L-functions attached to 
quite general algebro-geometric situations over finite fields, are rational functions. 
In many cases, either this function or its “interesting part” is a polynomial with 
Q-coefficients. In such cases, it is natural to wonder about the factorization 
properties of this Q-polynomial. This question was first investigated by Chavdarov 
[Chav, Theorems 2.1, 2.3, 2.5], who used monodromy techniques to show that for 
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a fixed genus g > 1, most genus g curves over a large finite field F, have the 
numerator of their zeta function Q-irreducible, i.e., the fraction of the genus g curves 
over IF, with this irreducibility property tends to 1 as q grows. [Strictly speaking, 
Chavdarov’s literal result requires g to be a power of a fixed prime p.] Recently 
Kowalski [Kow-LSM] combined Chavdarov’s monodromy methods with large sieve 
techniques to give quantitative refinements of Chavdarov’s results. 

It occurred to the author in the fall of 2001 that one might apply Chavdarov’s 
ideas to study the irreducibility properties of L-functions of elliptic curves EF over 
one-variable function fields K over finite fields F’,. Here one knows that, as long as 
the j-invariant is non-constant, the L-function is a polynomial with Z-coefficients, 
of known degree d, of the form 


L(T) = det(1 — gTA) 


for a (necessarily unique up to conjugacy) element A in the compact real orthogonal 
group O(d, R). The unitarized L-function, 


L,(T) := L(T/q) = det(1 — TA) 


thus has coefficients in Z[1/q]. Being the reversed characteristic polynomial of an 
element A in O(d,R), it satisfies the functional equation 


T?L,(1/T) = det(—A)L,(T). 


Here det(—A) = +1 is the “sign in the functional equation”. With this normal- 
ization, the point T = 1 is the Birch and Swinnerton-Dyer point. The Birch and 
Swinnerton-Dyer conjecture states that the Mordell—Weil rank of E/K, MWrk. 
(E/k), is equal to the multiplicity of T = 1 as a zero of L,,(T) (which has come to 
be called the “analytic rank” of E/K, an.rk.(E/k). One has (in the function field 
case) the a priori inequality 


MWrk.(E/k) < an.rk.(E/k). 


The analytic rank is odd if and only if the sign in the functional equation is —1, in 
which case the analytic rank, being odd, is at least 1. On the other hand, if the sign 
in the functional equation is +1, then the analytic rank, being even, has “no reason” 
to be nonzero. There is a general expectation that, in any reasonable enumeration 
sense, “most” elliptic curves will have the lowest possible analytic rank, i.e., 0 or 1, 
that is compatible with the sign in their functional equations. We refer the reader to 
[deJ-Ka, 9.7], [Ka-TLFM, 8.3, 9.11, 10.3] and [Ka-MMP, 13.1.7] for one approach 
to this sort of question. 

One knows that, depending on the parity of d and on the sign in the functional 
equation, either 1 or —1 or both or neither necessarily occur as “imposed” 
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eigenvalues of an element A in O(d,R). More precisely, for d odd, —det(—A) is 
always an eigenvalue of A. For d even and det(—A) = —1, both +1 are always 
eigenvalues of A. So it is natural to introduce the “reduced” polynomial 


Rdet(1 — TA) := det(1 — TA)/(1—T); d odd, sign — 1, 
Rdet(1 — TA) := det(1 — TA)/(1 + T); d odd, sign + 1, 
Rdet(1 — TA) := det(1 — TA)/(1 — T”); d even, sign — 1, 
Rdet(1 — TA) := det(1 — TA); d even, sign + 1, 


and the reduced (unitarized) L-function 
Lured(T) := Rdet(1 — TA) 


We propose to show that in various settings, “most” elliptic curves have their 
reduced L-functions Q-irreducible. The relevance to the Birch and Swinnerton- 
Dyer Conjecture is simply this: so long as the reduced L-function has degree 
>2, if it is Q-irreducible, then it cannot have T =1 as a root, and hence its 
analytic rank is as low as possible. This consequence for analytic rank gives nothing 
better than the already cited results [deJ-Ka, 9.7], [Ka-TLFM, 8.3, 9.11, 10.3] 
and [Ka-MMP, 13.1.7], the only interest is in the methods. [Work of Emmanuel 
Kowalski [Kow-RQT], Chris Hall [Ha], and Florent Jouve [Jo], using related 
ideas together with large sieve technology, allows one to do better.] It would be 
interesting to understand what is the analogue, if any, in the number field case, of 
the irreducibility of the reduced L-function. 

To end this introduction, let us mention briefly a natural question that we do 
not discuss at all; given that “most” elliptic curves have their reduced L-functions 
Q-irreducible, what are the galois groups (of the splitting fields, over Q, of) 
the Q-irreducible polynomials which arise? A natural guess is that for d odd, 
say d =2n + 1, we should “usually” get the Weyl group of the root system By, 
independent of the sign in the functional equation. For d even and sign +1, say 
d =2n, we should “usually” get the Weyl group of the root system D,,. But for 
d even and sign —1, say d = 2n + 2, we should “usually” get! the Weyl group of 
the root system C;,. The analogous question for families of curves of genus g, where 
we have symplectic monodromy, was posed and answered by Chavdarov [Chav] and 
made more quantitative by Kowalski [Kow-LSM]; here the galois group is “usually” 
the Weyl group of the root system Cy. 


'The reason we expect this Weyl group is the fact [Weyl, (9.15) on p. 226] that in the compact 
orthogonal group O(2n + 2,R), the space of conjugacy classes of sign (here sign = determinant) 
—1 is, with its “Hermann Weyl measure” of total mass one, isomorphic to the space of conjugacy 
classes in the compact symplectic group USp(2n), with its “Hermann Weyl measure” of total 
mass one. 
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These results were worked out in the author’s Princeton graduate courses of 
Fall, 2001 and of 2004—2005, and were presented in lectures at the University of 
Minnesota (2001), NYU (2001), the Newton Institute (2004), the University of 
Tokyo (2004), and Brown University (2005). It is a pleasure to thank the listeners 
for their stimulating questions. 


2 The general setup, and the basic examples 


We work over an integral domain R which is normal, finitely generated as a 
Z-algebra, and whose fraction field has characteristic zero. Typically, R will simply 
be Z[1/N] for some integer N > 1. Over R, we are given a smooth R-scheme M/R 
of relative dimension v > 1 with geometrically connected fibres. Over M, we are 
given a proper smooth curve C/M and a closed subscheme D C C which is finite 
etale over M. We denote by U/M the open curve 


U :=C-D. 


Finally, over U we are given a relative elliptic curve E/U. 

Before going further, let us give the two basic examples we have in mind. 

The first example is the universal family of good degree d polynomial twists of 
the Legendre curve. Here R is Z[1/2]. We fix an integer d > 3, and take for M 
the open set Twisty in the affine space Ad of all monic, degree d polynomials 
in one variable A consisting of those polynomials f(A) for which the product 
f(O) f()Discrim(f) is invertible. Over this Twistg we have the universal such 
polynomial, finiv, and we have the constant curve P!/Twistz, with coordinate A, 
in which we take for D the disjoint union of the sections 00, 0, 1 and the zero locus 
of finiv. So D is finite etale over Twisty of degree d + 3. Here we have 

U = Atwigy[1/A(A — 1) funiv(A)]. 
Over this U, we take for E/U the twisted Legendre curve in Pe whose affine 
equation is 


y? _ Taniv (A) x (x = 1)(x —X). 


For each finite field k of odd characteristic, and for each k-valued point f in 
Twist,(k), we obtain a relative elliptic curve Ex, over the punctured A-line 
A,[1/A(—1) f(A)], namely the twisted Legendre curve y* = f(A)x(x—1)(x—A). 
Its L-function is a polynomial of degree 2d if d is even, and of degree 2d — 1 if 
d is odd. We will show that as #k grows, the fraction of twisting polynomials f 
in Twistg(k) for which the reduced L-function of the twisted Legendre curve is Q- 
irreducible tends to 1. On the other hand, we have at present no means of addressing 
the following extremely natural question. Fix a finite field k of odd characteristic, 
and consider, as the integer d grows, the fraction of twisting polynomials f in 
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Twistg(k) for which the reduced L-function of the twisted Legendre curve is Q- 
irreducible. Does this fraction tend to 1 as d grows but k stays fixed? To some other 
nonzero limit (cf. [Poonen] for an analogous situation)? To any limit? 

The second example is the universal family of good Weierstrass curves with 
g2 and g3 of at most specified degrees dz and d3 respectively. Here R is Z[1/6]. 
We fix integers d) > 3 and d3 > 3, and we suppose that either d, > 5 or that 
d3 > 7. We take for M the open set W(d2, d3) in the affine space Are x A? 
consisting of those pairs of polynomials (g2(t), g3(t)) of degrees at most (do, d3), 
for which the auxiliary polynomial A(g>, ¢3) := g2(t)* — 27g3(t)* has degree 
exactly Max(3d2,2d3) and has its discriminant invertible. Over W(d2,d3) we 
have the universal pair (29 yniv(t), £3.univ(t)), the constant curve P!/ W(ds, d3) with 
coordinate ¢, and the divisor D which is the disjoint union of the section oo and the 
zero locus of A(22,univ(t), £3.univ(t)). So D is finite étale over W(d2, d3) of degree 
1 + Max(3d2, 2d3). Here we have 


U= AW(é;.ayl1/ A(82,univ(t), 83univ(t))]. 


Over this U, we take for E/U the relative elliptic curve given in Pe whose affine 
equation is the universal Weierstrass equation 


a = 4x3 — 22.uniy (t)xX — 83,univ(t). 


For each finite field k in which 6 is invertible, and for each k-valued point 
(go(t), g3(t)) in W(d2, d3)(k), we obtain the relative elliptic curve Ex.¢,,¢, over 
the punctured t-line Aj[1/A(g2, g3)], namely the Weierstrass curve y? = 4x? — 
go(t)x — g3(t). Its L-function is a polynomial of degree Max(3d2,2d3) — 2 if 
12 divides Max(3d>, 2d3), otherwise of degree Max(3d>,2d3) — 4. We will show 
that as #k grows, the fraction of points (g2(t), g3(¢)) in W(d2, d3)(k) for which 
the reduced L-function of the corresponding Weierstrass curve is Q-irreducible 
tends to 1. Just as in the first example, if we fix a finite field k in which 6 is 
invertible, and vary the integers (dz, d3) in such a way that, say, Min(d2, d3) grows, 
we have no understanding of the limiting behavior, if any, of the fraction of points 
in W(d2, d3)(k) whose reduced L-function is Q-irreducible. 


3 Back to the general setup; axiomatics 


We return to the general setup. Thus R is an integral domain which is normal, 
finitely generated as a Z-algebra, and whose fraction field has characteristic zero, 
and M/R is smooth of relative dimension v > 1 with geometrically connected 
fibres. Over M, we are given a proper smooth curve C/M and a closed subscheme 
D CC which is finite étale over M. U/M is the open curve 


U:=C-—D, 
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and over U we are given a relative elliptic curve E/U. So our picture is 
E>UCC->M = Spec(R). 
Let us name these morphisms, say 


f:E-U, 
J:UCC, 
m:C—-M. 


If k is a finite field and m € M(k) is a k-valued point of M, then by base 
change we obtain from E/U/M an open curve U;.»,/k and a relative elliptic curve 
Ex.m/Ukm/k. Let us recall the cohomological genesis of its unitarized L-function. 

For a prime number £, and A any of the rings Fe, Ze, Q¢ or Q, consider the lisse 
sheaf on U[1/£] given by 

Fa:= R'f,A. 


It is a sheaf of free A-modules of rank 2, whose determinant is canonically the 
Tate-twisted constant sheaf A(—1). So we have a canonical symplectic autoduality 
paring 

Fa x Fa = A(-1). 


Because R and hence M are normal and connected of generic characteristic zero, 
any lisse A-sheaf on U[1/£] (here F4) is tamely ramified along the finite étale 
divisor D[1/£]. We next consider its extension by direct image, 


Ga = jrFa, 
on C [1/€]. The autoduality pairing on F4 extends by direct image to a pairing 
Ga xGa —> jx A(-]) = A(-1). 


The formation of G4 on C[1/£] commutes with arbitrary base change on M[1/€], 
and its restriction to D[1/£] is a lisse sheaf of free A-modules on D[1/£]. We then 
form the Tate-twisted higher direct image sheaf 


Ha := R'n.Ga(1) 
on M[1/€]. This is a lisse sheaf of (not necessarily free, when A is Ze) A-modules 
of finite type. Its formation commutes with arbitrary base change on M [1/€]. It is 


endowed with an A-linear cup product pairing 


Ha XH, > R’n,A(1) & A. 
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When A is a field, this pairing makes 14 orthogonally self-dual. When A is Q, or 
Qe, then Hy is, in addition, pure of weight zero. We view the lisse sheaf H,4 as a 
representation of 7, (M [1/£]). 


Theorem 3.1 Jn the general setup E/U/M/R as above, there exist integers d > 0 
and N > | such that for € not dividing N, Hz, is a lisse sheaf of free Zp-modules 
of rank d on M[1/€] which, by the cup product pairing 


Hx, x Hz, => Zu, 


is orthogonally self-dual over Ze. 


Proof. Pick an embedding of R into C, and make the extension of scalars from R 
to C. We denote by the superscript an the corresponding analytic objects. Thus we 
have the locally constant sheaf H.7" of finitely generated abelian groups on M™, 
endowed with the cup product pairing to Z". If we tensor it with Q, we obtain 
the locally constant sheaf H' on M™", which by cup product is orthogonally self- 
dual. We take for d the rank of H2. If we invert a suitable integer N > 1, and 
tensor 7" with Z[1/N] to obtain (by the flatness of Z[1/N] over Z) Hz.) 
we find that H7/,,,\ is a locally constant sheaf of free Z{1/N]-modules of rank 
d which under cup product is orthogonally self-dual over Z[1/N]. We can take this 
N to be the N of the theorem. Indeed, for any ¢ not dividing NV, we can make 
the flat extension of scalars from Z[1/N] to Ze and infer that Hz) is a lisse sheaf 
of free Ze-modules of rank d on M*" which is orthogonally self-dual over Ze. By 
the comparison theorem, the restriction to Mc of Hz, is therefore a lisse sheaf of 
free Ze-modules on Me which is orthogonally self dual over Z,. It follows that 
the lisse sheaf Hz, on M[1/€] itself is torsion-free and Z;-autodual under the cup 
product pairing. Indeed, it suffices to check both the torsion-freeness of the lisse 
sheaf in question, namely Hz,, and the Ze-nondegeneracy of the pairing, at a single 
geometric point of M [1/£]. 


We now consider two fibrewise conditions that may or may not hold in our 
general setup. Both of these conditions do hold in both of the examples given 
above (Legendre twists and Weierstrass families), cf. [Ka-MMP, 8.2.3 and 10.2.13] 
respectively for these two cases. 


(1) For every finite field k, and for every k-valued point m in M(k), the relative 
elliptic curve Ex. /Ux.m/k has non-constant j-invariant. 

(2strong) For every finite field k and every ring homomorphism ¢ : R > k, denote 
by Mi.4/k the fibre of M/R above (k, ¢). For every £ invertible in k, consider 
the restriction to M; 4 of the lisse sheaf Hg, on M [1/£]. View this lisse sheaf as a 
representation px.¢,c : 71(Mxg) > O(d, Qe). Under every such homomorphism 
Pk.o,t, the image in O(d, Qr) of the geometric fundamental group 


1" (Mi 6) = (Mig Wr k) <q (Mig) 


is Zariski dense in O(d,Q,). 


328 N.M. Katz 


In certain applications, cf. [Ka-MMP, 7.2.7, 8.2.5, 10.2.15] and [Ka-TLFM, 
8.5.7, 8.6.7], one knows only that the following weaker version of the second 
condition holds. 


(2weak) For every finite field k and every ring homomorphism ¢ : R > k, denote 
by Mi.4/k the fibre of M/R above (k, ¢). For every £ invertible in k, consider 
the restriction to My 4 of the lisse sheaf Hg, on M [1/£]. View this lisse sheaf as a 
representation px,¢,¢ : 71(Mxg) — O(d, Qe). Under each such homomorphisms 
Pk.o,t, the image in O(d, Qr) of the geometric fundamental group 


1" (Mi 6) = (Mig Wr k) <q m1 (My) 


is Zariski dense in either SO(d, Q,) or in O(d,Q)). 


If the first condition holds, then for every ¢ invertible in k, the lisse sheaf Fg, on 
Uk.m/k is geometrically irreducible, and (hence) the unitarized L-function is given 
by the action of the Frobenius conjugacy class Frob; », in 1;(M[1/€]) on the lisse 
sheaf Hg,: 

Lu(Exm/ Uk m: T)= det(1 _ TFrobx. m|Ho,)- 


[If we do not impose the first condition, the lisse sheaf Fg, on Ux m/k could 
be geometrically constant (e.g., if E/U were a constant elliptic curve), in which 
case the unitarized L-function would not be a polynomial, but rather a rational 
function whose numerator is given by the right hand side.] Since these unitarized 
L-function have rational coefficients which “do not know about ¢’”, we see that the 
sheaves Hg, on M [1/€] form, as ¢ varies, a “compatible system of orthogonal ¢-adic 
representations” on M. Moreover, and this is the import of the previous theorem, 
there exists a single orthogonal group O(d)/Z[1/N], corresponding to a quadratic 
form over Z[1/N] in d variables whose discriminant is invertible in Z[1/N], such 
that for every € we land in its Qy-points, and such that for € prime to NV, we land 
in its Zp points. What is essential here is“only” the following (apparently weak) 
consequence of this last fact: for almost all € (namely those ¢ prime to NV), we are 
landing in the Zz points of an orthogonal group over Z, corresponding to a quadratic 
form over Z, in d variables whose discriminant is invertible in Z,. 


For each finite field k and each homomorphism ¢ : R — k, denote by 
IrrFrac(k, @) € Q the fraction of the k-valued points m in the fibre My.g/k for 
which the reduced unitarized L-function Lyrea(Exm/Uk.m, T) is Q-irreducible. 


4 Statement of the main theorem 


Theorem 4.1 In the general setup E/U/M/R, suppose that the fibrewise condi- 
tions (1) and (2weak) of the previous section hold. Suppose also that d, the common 
degree of the L-functions, is = 3. Given a real number € > 0, there exists a real 
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constant X = X(e,E/U/M/R) such that for any finite field k with #k > X, and 
any homomorphism @ : R — k, we have 


IrrFrac(k, @) > 1—e. 


5 Statement of an abstract version of the main theorem 


Let us now consider an abstract version of our situation. We are given a finitely 
generated Z-algebra R. Over R, we are given a smooth R-scheme M/R of relative 
dimension v > 1 with geometrically connected fibres. We are given an integer 
d > 3. For every prime @ such that M[1/€] is nonempty, we are given a lisse 
Qy-sheaf Hg, on M[1/£] of rank d, together with a symmetric autoduality pairing 


Ha, x Hq > Q. 


These sheaves are assumed to form a compatible system of £-adic representations 
on M (in the sense that each characteristic polynomial of Frobenius has rational 
coefficients which are independent of the auxiliary choice of allowed £). Each sheaf 
Hg, is assumed pure of weight zero. For all but finitely many @, say for all £ outside 
a finite set S of primes, we are given a lisse Zy-sheaf Hz, on M[1/€] of free Z, 
modules of rank d, together with a symmetric autoduality pairing over Z,, 


Hz, x Hz, > Ze, 


which is an integral form of Hg, with its autoduality pairing. 

For each finite field k and each homomorphism ¢ : R — k, denote by 
IrrFrac(k,@) € Q the fraction of the k-valued points m in the fibre My.g/k for 
which the reduced characteristic polynomial Rdet(1 —TFrob, »,|H.) is Q-irreducible. 


Theorem 5.1 In the abstract version given above, with d > 3, suppose that the 
fibrewise condition (2weak) of the previous section holds. Given a real number 
€ > 0, there exists a real constant X = X(e, R) such that for any finite field k 
with #k > X, and any homomorphism $ : R — k, we have 


IrrFrac(k, @) > 1l—e. 


We will fix ¢€>0, and prove the theorem for this value of €. We reduce 
immediately to the case when R is reduced. If we have a finite decomposition 
of Spec(R) as the disjoint union of finitely many locally closed, reduced affine 
subschemes Spec(R;), it suffices to prove the theorem (for our fixed € > 0), over 
each Spec(R;) separately. Indeed, then we can take X(e, R) to be Max; (X(e, R;)). 
So by noetherian induction on Spec(R), it suffices to prove that the theorem holds, 
for our fixed € > 0, in some affine open neighborhood of some maximal point of 
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Spec(R). Any sufficiently small such open neighborhood is of the form Spec(R}), 
with R; a normal integral domain which is a finitely generated Z-algebra. Making 
the extension of scalars from R to R;, we are reduced to proving the following 
“generic” version of the theorem. 


Theorem 5.2 In the abstract version given above, with d > 3, suppose that the 
fibrewise condition (2weak) of the previous section holds. Suppose in addition that 
R is anormal integral domain which is a finitely generated Z-algebra. Given a real 
number € > 0, there exists a real constant X = X(e, R) and a nonzero element 
r =r(e) € R, such that for any finite field k with #k > X, and any homomorphism 
o@:R—k for which o(r) 4 0, we have 


IrrFrac(k, @) > 1l—e. 


6 Interlude: Review of orthogonal groups over finite fields 
of odd characteristic 


In this section, we fix an integer d > 1, a finite field E = F, of odd characteristic, 
and a nondegenerate quadratic form in d variables over E, i.e., a d-dimensional E 
vector space V endowed with a symmetric E-bilinear form V : V x V — E which 
makes V autodual. We denote by O(V, VY) := Autg(V, V) the corresponding finite 
orthogonal group. 


One knows that for fixed d and E, there are precisely two isomorphism classes 
of nondegenerate quadratic form, distinguished by whether or not the discriminant 
is a square in E*. When d is odd, the two isomorphism classes give rise to the same 
orthogonal group; indeed if (V, WV) represents one class, then for any nonsquare 
a € E%*, (V,aW) represents the other, while visibly their orthogonal groups 
coincide. So we may speak unambiguously of the group O(d, E) when d is odd. 


When d = 2n is even, then the two cases are called the split case and the 
nonsplit case. The standard model for the split case is given by the quadratic 
form )~"_, X;Xn4; (so here (—1)”Discriminant is a square), which we will denote 
(split, , std). A convenient model for the nonsplit case is to take V := F,2 as our 
IF, vector space, endowed with the symmetric bilinear form 


W(x, y) = (1/2)Traceg,., 2, (x9""), 
and quadratic form 
W(x, x) = Tracer, /r, (Norme , [Fyn (X)). 


For ease of later reference, we will refer to this model as the standard nonsplit 
model, and denote it (IFj2., std). The split and nonsplit orthogonal groups are not 
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isomorphic; they even have different orders. We will denote them O,p1(d, E) and 
Ononsp\(d, E) respectively when we need to distinguish them. 

On the Clifford algebra C] := CI(V,W) attached to (V,W), we have the 
E-algebra involution J whichis v +> —von V, and the E-algebra anti-automorphism 
x + t(x) which is the identity on V. We have its unit group C/*. The unit 
group acts on the Clifford algebra by the sign-twisted conjugation action: u € C1” 
acts as x +> I(u)xu7!. Inside C1* we have the (twisted) Clifford group, namely 
the subgroup C* consisting of those elements which map V to itself. Every 
nonisotropic v € V lies in C*, for the map x t> I(v)xv7! is then, for x € V, 
reflection in v. Moreover, one knows that every element of C* is a nonzero scalar 
times a (possibly empty) product of nonisotropic vectors v € V; this corresponds 
to the fact that in the orthogonal group O(V, WY), every element is a product of 
reflections in nonisotropic vectors. For u € C%, its “norm” N(u) := t(u)u lies in 
E*, and x t+ N(x) is a group homomorphism. Its kernel is the group Pin(V, W): 


Pin(V, Y) := Ker(N : C* > E%). 


The subgroup of Pin(V, Y) consisting of the elements fixed by the involution J is 
the group Spin(V, W). 


Remarks 6.1 The reader should be warned of a possible source of serious confu- 
sion. In the older literature, e.g., [Artin-GA], [Bour-AlgIX] and [Chev-Spin], the 
unit group is made to act on the Clifford algebra by the literal conjugation action: 
u € C1* acts as x +> uxu!, and one takes the (untwisted) Clifford group, denoted 
T in [Chev-Spin, 2.3], accordingly. This leads to unpleasant difficulties, centered on 
the fact that when V is odd-dimensional, there are nonscalar elements of I’ which act 
trivially on V, and the “norm” of an element of I’ need not be a scalar. Contorsions 
are adopted to get around these difficulties; one obtains the group Spin(V, WY), but 
there is no Pin(V, WY) in the older theory. The sign-twisted approach, and the group 
Pin, first appeared in [AtBS-Clif, 1.7, 3.1], cf. also [Kar-Clif, 1.1.4-8]. 


We have an exact sequence 
{1} => +1 > Pin(V, V) > O(V, YW) > +1, 
in which the last map is the spinor norm, denoted 
sp: O(V, WV) > +1. 
The spinor norm is determined by its value on reflections Rf/, in nonisotropic 
vectors v € V (since these elements generate O(V,W)). For these, we have the 


explicit formula 


sp(Rf1,) = the class of W(v, v) in EX /(E*)? & +1. 
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If d > 2, the spinor norm is surjective, and we have a short exact sequence 
{1} > +1 > Pin(V, V) > O(V, VW) > +1 > {1}, 


under which the inverse image of SO(V, WV) in Pin(V, W) is Spin(V, WY). So we also 
have the more standard short exact sequence 


{1} > +1 > Spin(V, VY) > SO(V, Y) > +1 > {1}. 
We also have the determinant homomorphism 
det: O(V, V) > +1. 


The simultaneous kernel of these two homomorphisms, sp and det, is denoted 
QV, W). 

When d > 5, or when d = 4 and we are in the nonsplit case, or when 
d = 3 and the characteristic is > 5, the group Q(V, WV) is, modulo its center, a 
nonabelian simple group, cf. [Artin-GA, Theorems 4.9, 5.20, 5.21, 5.27]. Moreover, 
in these cases, the only proper normal subgroups of Q(V, VY) are subgroups of its 
center, and consequently Q(V, V) is its own commutator subgroup. The center of 
Q(V, W) is trivial if either d is odd or if the discriminant is a nonsquare, otherwise 
it is +1. When d = 4 and the characteristic is >5 and we are in the split case, 
then Q(V,W)/ + 1 is the product PSL(2, E) x PSL(2, E) of the simple group 
PSL(2, E) with itself, cf. [Artin-GA, Theorem 5.22], and Q(V,W) is its own 
commutator subgroup [being a quotient of Spin(V,V) = SL(2,E) x SL(2,E), 
which is its own commutator subgroup]. 

One knows [Artin-GA, Theorems 5.14, 5.17] that Q(V, VY) is the commutator 
subgroup of O(V, W); indeed this was its definition before Chevalley introduced the 
use of Clifford algebras in these questions, cf. [Die-GC, Chpt. III, Section 12, p.23]. 
For d > 2, the quotient group O(V, V)/Q(V, W) is, by the pair of maps (det, sp) 
the group {+1} x {£1}. We will need to know, in each of the four cosets of Q(V, WV) 
in O(V, WY), lower bounds for the numbers of elements A whose reduced reversed 
characteristic polynomials Rdet(1 — TA) have, as E-polynomials, certain imposed 
factorization patterns. For each (a, 6) € {+1} x {£1}, we denote by 


O(V, V)(det = a, sp = B) 


the corresponding coset. 

There is a further cautionary remark we need to make at this point. Suppose 
d > 2; we are given a subgroup H of GL(V) := Autg(V), and we are told that 
H = O(V,W) for some symmetric autoduality Y. Then the subgroup Q(V, W) is an 
intrinsic subgroup of H, namely its commutator subgroup. The det homomorphism 


det: H > +1 
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is intrinsic on H as a subgroup of GL(V). However, the spinor norm homomor- 
phism 
sp: H > +1 


depends on the choice of W. Indeed, if we replace V by a nonzero scalar multiple 
aW with a a nonsquare, the orthogonal group does not change, but the two spinor 
norms are related by 

SP(vaw) (2) = det(h)spyyy(h). 


On the other hand, since the quotient group H/Q(V, W) is of type (2,2), with det 
and spyy,y) an F-basis of its character group, we see that for any Y; on V with 
O(V,V) = O(V, V1), we have either spyyy,)(2) = spry,y)(A) for every h € H, 
or we have spyy.y,)(2) = det(h)spiy.y)(2) for every h € H. Thus each of the two 
cosets of Q(V, W) in HN SL(V) = SO(V, WY) is intrinsic, e.g., one is a subgroup 
and one isn’t, but the two cosets of Q(V, VW) in H \ HN SL(V) = O(V,W) \ 
SO(V, VY) may be interchanged by different choices of W. In the discussion below, 
we work with particular models of our orthogonal groups, i.e., we make specific 
choices of WY. But we prove only statements which are invariant under replacing sp 
by det x sp. 


Lemma 6.2 Fix d = 2n > 2. Suppose q := #E > 7. In each of the two cosets 
of Q(d, E) in SOnonsp\(d, E), the fraction of elements A for which Rdet(1 — TA) is 
E-irreducible is at least 1/2n. 


Proof. In the standard nonsplit model (F,2n, std), the group (iggn >= Mitg:(F,2"), 
acting by homothety on Fj2n , lies in SOponspi (27, E'). Moreover, we know [Saito-sign, 
Lemma 1, parts 4 and 5] that the spinor norm, restricted to 4149", is trivial 
precisely on the subgroup f4(149")/2 Of squares. We also remark that every 
F,2»-homothety which lies in the orthogonal group lies in 414 ". It follows that if 
€ € [14g is an element such that the field F,(¢) is F,2., then its characteristic 
polynomial is an F,-irreducible palindromic polynomial, and its centralizer in 
Ononspi(2n, E) is the subgroup [414 " [simply because any F,-linear endomorphism 
A of Fj2 which commutes with ¢ is F,2.-linear, so an F,2.-homothety]. So for 
any such ¢, its conjugacy class in Oponspi(2, E) contains #Oponspi(2n, E)/(1 +g") 
elements, all of which have the same F,-irreducible palindromic characteristic 
polynomial as ¢, as well as the same spinor norm and determinant as ¢. If we 
take a second such element ¢; which is not one of the 2” Galois conjugates 
of ¢, then its characteristic polynomial is a different F,-irreducible palindromic 
polynomial, so certainly its conjugacy class in Oponspi(2n, E) is disjoint from that 
of ¢. [Conversely, Galois conjugate elements of 4144” are Ononspi(2n, E)-conjugate, 
since the Galois automorphisms of Fj2/F, lie in the orthogonal group, and their 
conjugation action on elements of j1;4,” is the same as their Galois action.] Denote 
temporarily by N4 the number of elements € € f4j4," of spinor norm + 1 such 
that the field F,(¢) is Fj2.. Taking the union of their conjugacy classes, we obtain 
#Ononsp\(2n, E)N4/2n(1 + qg”) elements in Oponspi(2n, E)(det = 1,sp = +1) 
with an E-irreducible palindromic characteristic polynomial. 
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One sees easily if € € f4149n is such that F,(¢) is a proper subfield of of F,2, 
then either € = +1, or F,(¢) is Fj20 for some divisor a < n of n, € € pli¢ge and 
n/a is odd. Thus, denoting [x] := Floor(x), at most 2 + [n/3]q'/*) of the elements 
in /414q" fail to generate F,2, over F,. So we have the estimates 


Na > (1+q")/2-—2-[n/3]g". 


Treating separately the cases [n /3] = 0 and [n/3] > 1, we see that so long as g > 7, 
we have 
Nz = (1+ q")/4. 


Thus we obtain at least 
#Ononsp\(2n, E)/8n = #Oponspi(2n, E)(det = 1,sp = £1)/2n 


elements in Oponspi(2n, E)(det = 1,sp = £1) with an E-irreducible palindromic 
characteristic polynomial. 


Lemma 6.3 Fix d = 2n > 4. Suppose q := #E > 7. In each of the two cosets 
O(d, E)(det = —1, sp = +1), the fraction of elements A for which Rdet(1 — TA) 
is E-irreducible is at least 1/4(2n — 2). 


Proof. We take as model of our quadratic space 
(Fg2n-2 ; std) @ (F,2 ; std) 


in the split case, and 
(Fg2n—2, std) ® (split, std) 


in the nonsplit case. Corresponding to these direct sum decompositions, we have 
inclusions of the corresponding orthogonal groups 


O(2n —2,E) x O(2, FE) C O(2n, E). 


In the orthogonal group of the first factor, take an element € € (y49.—1 which 
generates Fj2n—2 over Fy. In the orthogonal group of the second factor, take a 
reflection R of spinor norm one, e.g., take the reflection in a vector of square 
length one. The centralizer in O(2n, E) of the element (¢, R) is the product group 
My4gr-! X {£1, +R}. [Indeed, if an element A in an orthogonal group over a field 
of characteristic not 2 has a (reversed or not, the two agree up to sign) characteristic 
polynomial which is a product |]; fi(T) of pairwise prime polynomials, each of 
which has its roots stable by x t+ 1/x, then the decomposition of the ambient 
vector space V as the direct sum of the spaces V; := Ker(f;(A)) is an orthogonal 
decomposition. Any endomorphism B of V which commutes with A preserves this 
decomposition, say B = @;(B; on V;), and on each V;, B; commutes with A|V;. 
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Moreover, if B is orthogonal, then so is each B;.] The counting argument used to 
prove the lemma above then gives the asserted result. oO 


Lemma 6.4 Fix d = 2n+1 > 3. Suppose q := #E > 7. In each of the four cosets 
O(d, E)(det = +1, sp = +1), the fraction of elements A for which Rdet(1 — TA) 
is E'-irreducible is at least 1/4n. 


Proof. We take as model of our quadratic space 
(F,2, std) ® (Fy, x”), 


and repeat the previous argument, now using elements of the form (¢, +1). Oo 


Lemma 6.5 Fix d = 2n > 6. Suppose q := #E > 7. Fix a partition of n as 
n=a+bwith1 <a < b. Ineach of the two cosets of Q(d, E) in SOs (d, E), 
the fraction of elements A for which Rdet(1 — TA) is of the form 


(E — irreducible of degree 2a)(E — irreducible of degree 2b) 


is at least 1/32ab. 


Proof. We take as model of our quadratic space 
(F420 , std) @ (F 2 std), 


and repeat the previous argument, now using elements of the form (Ca € [i+g2, 5 € 
Hi+49>)- If &a (respectively ¢,) has full degree 2a (respectively 2) over Fy, the 
centralizer of this element in O(2n, E) is the product group /41+92 X /414,45, and the 
argument concludes as before. oO 


Lemma 6.6 Fix d = 2n > 4. Suppose q := #E > 7. In each of the two cosets of 
Q(d, E) in SOs (d, E), the fraction of elements A for which Rdet(1 — TA) is of the 


form 
(E — irreducible P(T) of degree n)(E — irreducible O(T) of degree n), 
with P and Q relatively prime, and with 
Q(T) = (some constant in E*)T” P(1/T), 


is at least 1/2n. 


Proof. We take as model 
V = Fyn @ Fy, 


with the split quadratic form 


W(x By, x @ y) = Tracepyn sp, (xy). 
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The group Fj, is then a subgroup of SO(V, WV), with ¢ € Fj, acting as 


o'):xO@yRbxec ly. 


By [Saito-sign, Lemma 1.1, part 2], the spinor norm, restricted to this Fin subgroup, 
is trivial precisely on the squares. Take an element (€, ~!) such that ¢ has full degree 
n over F,, and such that ¢ and ¢~' have different irreducible polynomials over F, 
(i.e., such that € and ¢~! are not Galois conjugate). Then the centralizer of (¢, ~') 
in O(V,W) is precisely the subgroup F7,. Moreover, knowing the characteristic 
polynomial of (¢,¢~!) determines ¢ up to replacing it by either one of its n 
conjugates or by one of the n conjugates of ¢~!. 

The ¢’s which fail the first condition are those which lie in a proper subfield Fa 
for some divisor a < n of n. Those which fail the second condition are those which 
lie in some subgroup [414 ,«, with 2a|n. For qg > 7, a routine counting shows that 
the number of ¢’s which fail one or both of the two conditions is at most (q” — 1)/4. 
The argument now concludes as before. 


With these preliminary lemmas established, we get the following product 
theorems. 


Theorem 6.7 Fix an odd integer d = 2n + 1 > 3, an integerr > 1, anda list of r 
primes 
7T<2, <l,<+:-<,. 


Denote by G the subgroup of the product group ||; O(d,F¢,) consisting of those 
elements (A,,...,A,;) all of whose determinants, viewed in +1, coincide. In 
any coset of [[; Q(d,F¢,) in G, the fraction of elements (A,,...,A,) such that 
Rdet(1 — TA;) is F¢,-irreducible for some i is at least 


b= = 1/4ny. 


Proof. The point is that the quotient G/[]; Q(d,Fe,) is naturally the prod- 
uct of r + 1 copies of +1, by means of the common value of the determi- 
nant and the spinor norms of the factors. So any coset is a product, either 
of cosets O(d,Fe,)(det = 1,sp = aj), or of cosets O(d,Fy,)(det = —1, 
sp = a;). The assertion is now immediate from Lemma 6.4. 


Theorem 6.8 Fix an even integer d = 2n > 4, an integerr > 1, and a list of r 
primes 
T<bi <b << €,. 


Denote by G the subgroup of the product group []; Ononspi(d, Fe,) consisting of 
those elements (A,,..., A,) all of whose determinants, viewed in +1, coincide. In 
any coset of [[; Q(d,F¢,) in G, the fraction of elements (A,,...,A,) such that 
Rdet(1 — TA;) is F¢,-irrreducible for some i is at least 


1—(1-1/8n)’. 
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Proof. By the product structure of the coset, the assertion is immediate from 
Lemmas 6.2 and 6.3. 


Theorem 6.9 Fix an even integer d = 2n > 4, an integerr > 1, and a list of r 
primes 
T<bhi <b <--- <,. 


Denote by G the subgroup of the product group ||; O(d,F¢,) consisting of those 
elements (A,,...,A,) all of whose determinants, viewed in +1, coincide; the factor 
groups may be separately split or nonsplit at will. In any coset of []; 2(d,F¢,) in 
G for which the common value of the determinant is —1, the fraction of elements 
(Aj,...,A,) such that Rdet(1 — TA;) is F¢,-irrreducible for some i is at least 


1—(1—1/8n)’. 


Proof. By the product structure of the coset, the assertion is immediate from 
Lemma 6.3. 


Theorem 6.10 Fix an even integer d = 2n > 6, an integerr > 1, anda list of r 
primes 
T<li <n <-++- <8. 


Choose a partition of n, sayn = a+b with 1 <a <b. Denote by G the subgroup 
of the product group |]; Ospi(d, F¢,) consisting of those elements (A,,...,A,) all 
of whose determinants, viewed in +1, coincide. In any coset of aE Q(d,F¢,) in 
G for which the common value of the determinant is +1, the fraction of elements 
(A,,...,A,) such that Rdet(1 — TA;) is of the form 
(E — irreducible of degree 2a)(E — irreducible of degree 2b) 

for some i, AND such that Rdet(1 — TA;) is of the form 

(E — irreducible of degree n)(a different E — irreducible of degree n) 
for some j, is at least 


L=(1 =1/a3aby = =17ony. 


Proof. By the product structure of the coset, the assertion is immediate from 
Lemmas 6.5 and 6.6. oO 


Theorem 6.11 Fix d = 4, an integer r > 1, anda list of r primes 


T<bl, <b. <+::<,. 
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Denote by G the subgroup of the product group ||; Ospi(4, Fe, ) consisting of those 
elements (A,,...,A,) all of whose determinants, viewed in +1, coincide. In any 
coset of |]; 82(4, Fe,) in G for which the common value of the determinant is +1, 
the fraction of elements (A,,..., Ay) such that Rdet(1 — TA;) is, for some i, of the 
form 


P(T)Q(T) 


with P(T) and Q(T) relatively prime F ¢, -irreducibles of degree 2, neither of which 
is palindromic, and such that 


Q(T) = (some constant in EX)T* P(1/T), 


is at least 


1—(i— 14)’, 


Proof. If we omitted the requirement that neither P(7) nor Q(T) be palindromic, 
the assertion would be immediate from the product structure of the coset, and 
Lemma 6.6. But the nonpalindromicity is automatic. Indeed, the fact that 


Q(T) = (some constant in EX)T” P(1/T), 


tells us that if € € Fy is a root of P(T), then 1/¢ is a root of O(T), hence 


cannot be a root of P(T), since P(T) and Q(T) are relatively prime. But the two 
roots of a palindromic polynomial of degree two are reciprocals. Thus P(T) is not 
palindromic, and similarly for Q(T). Oo 


7 Proof of Theorem 5.2, via a theorem of Larsen 


Let us put ourselves in the situation which Theorem 5.2 purports to treat. Choose 
a finite field k and a ring homomorphism ¢ : R — k (for instance, take a 
maximal ideal Z of R, take k to be R/Z, and take ¢ to be canonical map of R 
onto R/Z). Making the extension of scalars ¢ : R — k, we get the space Mx. 
On Mig, we have the restrictions of the sheaves Hq,, for all £ invertible in k, 
as well as of the restrictions of the sheaves Hz,, for all such £ not in the finite 
set S. For each ¢ invertible in k, let us denote by Ty the image in O(d,Q,) of 
the arithmetic fundamental group | (M;4) under the homomorphism which “is” 
Ho, |Mx,. Meanwhile, consider the composite map 


Spin(d, Qe) > SO(d,Q) C O(d, Q). 


According to a striking theorem of Larsen [Lar-Max, 3.17], the inverse image of I’ 
in Spin(d, Q,) is, for a set of primes £ of Dirichlet density one, a “hyperspecial” 
maximal compact subgroup of Spin(d, Q,). Now for all £ invertible in k and not 
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in S, Ty lies in O(d, Ze), and so its inverse image lies in Spin(d, Z:). Whenever 
this inverse image is a maximal compact subgroup of Spin(d, Qe), it must, by its 
maximality, be equal to the possibly larger compact subgroup Spin(d, Z;). Thus we 
infer that among the primes £ invertible in k and not in S, there is a set of Dirichlet 
density one, the “good primes” over (k, @), for which the inverse image of Ty in 
Spin(d, Ze) is the entire group Spin(d, Ze). 

For each of these good £, which we may take to all be > 5, let us denote by 
Inoa ¢ the image in O(d, F,) of the arithmetic fundamental group 7; (Mx.¢) under 
the homomorphism which “is” Hz, ®z, Fe|M;,4. Then by Larsen’s theorem, Dod ¢ 
contains Q(d, F¢) for these good £. Thus we have 


Q(d, Fy) Cc T mod £ Cc O(d, F). 
Let us denote by Pscom, moa ¢ the image in O(d, F;) of the geometric fundamental 
group 7, (My.4). Then 
T'geom, mod € <J Tnod > 


and the quotient is cyclic, being a quotient of Gal(k/k). We claim that for each 
good £, we have 


Q(d, Fy) CG Tgeom, mod ¢- 
Indeed, the intersection Q(d, F¢) AT geom, mod ¢ inside Mnoa ¢ is anormal subgroup of 
Q(d, Fe) with cyclic quotient. As d > 3, Q(d, F¢) is its own commutator subgroup, 


so it has no proper normal subgroup which gives a cyclic quotient. Thus for each 
good £ we have 


Q(d, Fe) Cc Tscom, mod @ © Tod ¢ Cc O(d, Fe). 


Suppose we are given an integer r > 1, and a list of r good primes 
T<h, <hb<-:-<h. 


Denote by G the subgroup of the product group [],; O(d, F, ) consisting of those el- 
ements (A ;,..., A;) all of whose determinants, viewed in +1, coincide. Denote by 


Tscom, mod ¢1,€2,...€; CGC I] O(d, Fy, ) 


L 


the image of the geometric fundamental group 27°" (Mx.g) under the direct sum of 


the various mod ¢; representations. 
A key point is the following result of Goursat—Ribet type, cf. [Ribet-Gal, 5.2.2]. 


Lemma 7.1 The group Dgeom.mod €;,l5,..., contains []; Q(d, F¢,)- 


] [2@. Fe) Cc T’scom, mod Bi bayeccsy CGC I] O(d, Fy,). 


L t 
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Proof. The projection of Tgcommod ¢;,02,....0- C []; O(d.Fe,) to each O(d, F2,) 
factor contains Q(d,Fe,), as we have noted above. Now consider the commutator 

t, Of Teeom.mod 0.6... ¢,. As each group Q(d, F¢,) 
is its own commutator subgroup, D is a subgroup of []; 2(d, F¢,) which maps onto 
each factor. 

Suppose first that either d 4 4, or that d = 4 and all our groups are nonsplit. 
Then the individual groups Q(d,F¢,) are simple modulo their centers, and the 
corresponding simple groups are pairwise non-isomorphic. So by Goursat’s lemma 
[Ribet-Gal, 5.2.1], D maps onto each pair of factors Q(d, F¢;) x Q(d,Fe;),i < j. 
Since each Q(d, F¢,) has no nontrivial abelian quotients, Ribet’s lemma [Ribet-Gal, 
5.2.2] shows that D is the full product []; Q(d, F,,). 

In the remaining case, when d = 4 and all the groups are split, each 
Q(4, Fe,)/ + 1is PSL(2, Fe,) x PSL(2, Fe,). We first note that as PSL(2, Fe,) is 
simple and nonabelian, the only quotient groups of Q(4, Fp,)/+ =< PSL(2, Fy,) x 
PSL(2, F¢,) are the four obvious ones ({1} x {1}, {1} x PSL(2, Fe,), PSL(2, Fe,) x 
{1}, PSL(2, Fe,)x PSL(2, F¢,)). So the only quotient groups of (4, F», ) are either 
these groups or, possibly, double covers of them. There is no quotient of order 2, 
since (2(4, F¢,) is its own commutator subgroup. Thus the only quotient groups 
Hy # {1} of Q(4, Fe,) have the property that €; is the largest prime dividing 
the order of H; (since £; is the largest prime dividing the order of PSL(2, Fi, )). 
Therefore if £; A ¢;, then no quotient H; A {1} of (2(4, Fy,) is isomorphic to 
any quotient H; 4 {1} of (4, F;,), simply because these quotients have different 
orders. So by Goursat’s lemma [Ribet-Gal, 5.2.1], D maps onto each pair of factors 
Q2(4, Fe,) x Q(4, Fe,),i < j, and the proof then concludes as before, by invoking 
Ribet’s lemma [Ribet-Gal, 5.2.2]. oO 


We now make a choice of the integer r > 1, and of the list of r good primes 
1<bi <b <...<,. 


Recall the real € > 0 in the statement of the theorem we are to prove. 
There are three separate cases to consider. 
If d = 2n + 1 is odd, we choose r large enough that 


(1—1/4n)’ < €/2, 
and we take any list of r good primes 
T<l, <b, <-:- <8. 


If d = 2n is even, we first look to see whether or not there are infinitely many 
good primes £ where our orthogonal group O(d,F¢) is nonsplit. If there are, we 
choose r large enough that 


(1—1/8n)" <€/2, 
and we take any list of r good primes 


T<l, <b. <::- <4, 
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at which the corresponding orthogonal group is nonsplit. 
If d = 2n is even, and there are at most finitely many good primes where our 
orthogonal group O(d, Fe) is nonsplit, then we choose r large enough that 


(1 —1/32n)" < €/4, 
and we take any list of r good primes 
T<b, <b <:-:- <4, 


at which the corresponding orthogonal group is split. 

We now study what happens in the geometric generic fibre of M/R. Denote by 
K the fraction field of R, by K an algebraic closure of K, and by M,; the K-scheme 
obtained by the extension of scalars R C K. 

Denote by 


Ty,2com,mod Lala sccily CGC I] O(d,F¢,) 


the image of the geometric fundamental group m;"""(M;) under the direct sum 


of the various mod £; representations. By a fundamental specialization result of 
Pink [Ka-ESDE, 8.18.2, (1)], this group contains (an [],; O(d, F¢, )-conjugate of) 
the group Pyeom.mod ¢;,l9,...., We obtained by looking at the image of 27°""(Mx.4). 
As []; 2(d,Fe,) is a normal subgroup of []; O(d,F:,), being its commutator 
subgroup, we therefore have 


I] Q(d, Fy, ) C T5,geom,mod £1 ,£2,...€r CGC I] O(d,F;,). 


By this same result of Pink [Ka-ESDE, 8.18.2, (2)], there is a dense open set U 
in Spec(R) such that for any geometric point s in U, the group 


T's, geom,mod bad sbr CGC a O(d, Fy), 


obtained by looking at the image of 2;"°"(M5), is equal to (an []; O(d, Fy,)- 
conjugate Of) T5 seom,mod ¢1,2,....¢-- AS every subgroup of I], O(d@, F¢,) containing 
T], 2(d, F¢,) is normal, we therefore have equality: 


V5. geom,mod lus = T5.geom,mod £y ,€9,...,€, 


for every geometric point 5 in U. Each of the primes €),...,€, is nonzero in R 
(because each is invertible in k under @), so by shrinking U we may further assume 
that each of them is invertible on U. 

We will show that the theorem holds, for the fixed « > 0, if we take forr € R 
any nonzero element such that Spec(R[1/7r]) C U. Denote by 
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Varith,mod ¢),09,...0, C G C I] O(d, F;,) 
I 


the image of the arithmetic fundamental group 2,(M[1/r]). We now apply the 
Chebotarev density theorem in the uniform version given in [Ka-Sar, 9.7.13] to this 
situation, our M [1/r]/R[1/r] taken as the X/S there, and with our groups 


T5,geom,mod £3 a scccly CG Darith mod £1 02, 6isbr 


taken as the groups K C Karim there. In our situation, the quotient Kajt,/K is 
abelian, so the sets Karin, there are just the cosets of K in K,,ith. For a given pair 
(k, p) consisting of a finite field & and a ring homomorphism ¢ : R[1/r] > k, 
all the Frobenius conjugacy classes attached to the k-points of M;.g lie in a single 
coset of K in Karith, Say Karith,y. Inside this coset Karith,y, take the subset W which 
is defined as follows. 

If d = 2n + 1 is odd, or if d = 2n and all the r orthogonal groups O(d, F¢,) are 
nonsplit, W consists of all elements (A,..., A;) in the coset Karith,,such that for 
some i, Rdet(1 — 7A;) is Fe, -irreducible. 

If d = 2n > 6and all the r orthogonal groups O(d, F¢,) are split, then W is the 
disjoint union of two sets, W_ and W,, defined as follows. The set W_ consists of 
those elements (A;,..., A,) in the coset Karith,, such that the common value of their 
determinant is —1 and such that for some 7, Rdet(1 — TA;) is Fe, -irreducible. The 
set Wi consists of those elements (A;,...,A;) in the coset Karith,, such that the 
common value of their determinant is +1 and such that for some i, Rdet(1 — TA;) 
is of the form 


(Fe, — irreducible of degree 2)(F, — irreducible of degree 2n — 2), 
AND such that for some 7, Rdet(1 — TA; ) is of the form 
(F¢, — irreducible of degree n)(a different F, — irreducible of degree n). 


If d = 4 and all the r orthogonal groups O(4, F¢,) are split, then W is again 
the disjoint union of two sets, W_ and W,. The set W_ is defined exactly as in the 
paragraph above. The set W+ consists of those elements (Aj,..., A;) in the coset 
Karith, Such that the common value of their determinant is +1 and such that for 
some i, Rdet(1 — 7A; ) is of the form 


P(T)Q(T) 


with P(T) and Q(T) relatively prime F,, -irreducibles of degree 2, neither of which 
is palindromic, and such that 


Q(T) = (some constant in Ff)T? P(1/T). 
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Decompose the K-coset Karith,y into cosets under the smaller group [], Q(d, F¢,), 
say 


Kaithy = | [ Coseta. 
In view of Theorems 6.7 through 6.10, we see that in each such coset, we have 
#(W M Coset,)/#Cosety > 1—«€/2. 
Summing over the cosets, we find that 
#W/#Karitny = 1—€/2. 


By the Chebotarev density theorem in the uniform version given in [Ka-Sar, 9.7.13], 
there exist constants C and A such that if #k > 4A2, then 


|#W/#K arith. — #{m € Mxg(k)|Frobim € W}/#Mx.o(k)| 


= 2C #Kaitn/Sqrt(#k) 


For #k sufficiently large, we obviously have 
2C#K aritn/Sqrt(#k) < €/2, 
and hence for #k sufficiently large we have 
#{m € My.¢(k)|Frobym € W}/#Mz g(k) = l—e. 


It remains only to show that whenever Frob, », lies in W, then Rdet(1—TFrob, ,,) 
is Q-irreducible. To see this, we argue as follows. This polynomial has coefficients 
in Z[1/#k]. If either d is odd, or d is even and each O(d, F;,) is nonsplit, or d is 
even and the sign in the functional equation is —1, then for some the reduction mod 
£; of this polynomial Rdet(1 — TFrob,.»,) is F¢, -irreducible, this being the defining 
property of W, and hence Rdet(1 — TFrob, .»,) is Q-irreducible. 

It remains to treat the case in which d is even, each O(d,F¢,) is split and the 
sign in the functional equation is +1. Suppose first that d = 2n > 6. Then for 
some i the reduction mod €; of Rdet(1 — TFrob,.»,) is the product of two F,,- 
irreducibles, of degrees 2 and d — 2, while for some j the reduction mod ¢ j of 
Rdet(1 — 7 Frob;.,,,) is the product of two F¢ ,-ureducibles, both of degree n, this 
being the defining property of W in this case. So once again Rdet(1 — TFrob, ,) 
must be Q-irreducible. [For if it were Q-reducible, its Q-factorization would 
simultaneously be of the form (degree 2 irred.)(degree d — 2 irred.) and of the 
form (degree n irred.) (degree n irred.).] Suppose now that d = 4. Then for some /, 
Rdet(1 — TFrob,,,,) is the product 


P(T)Q(T) 
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with P(T) and Q(T) relatively prime F,, -irreducibles of degree 2, neither of which 
is palindromic, and such that 


Q(T) = (some constant in Fx)T*P(1/T). 


This implies that Rdet(1—T Frob,.,,) is Q-irreducible. For if it were Q-reducible, its 
Q-factorization would be as the product of two relatively prime Q-irreducibles 
of degree 2, neither of which is palindromic. But Rdet(1 — T’Frob;,,) has all its 
eigenvalues on the unit circle (because pure of weight zero), hence both its Q- 
irreducible quadratic factors have roots stable by inversion € +» 1/¢. Since these 
Q-irreducible factors have degree 2, none of their roots is fixed by inversion (i.e., 
no root is +1), and hence each Q-irreducible factor has roots of the form (¢, 1/¢), 
hence is palindromic. 


8 Another application of Theorem 5.1: Universal families of 
hypersurface sections 


Let R be a finitely generated Z-algebra, P = P' /R the projective space of some 
dimension N, and X C P a closed subscheme which is smooth over R with 
geometrically connected fibres, all of some common odd dimension v = 2n+1 > 3. 
Fix an integer d > 1. Denote by M/R the parameter space for smooth, degree d 
hypersurfaces in the ambient P which are transversal to X, by Ha/M C P/M the 
universal family of these hypersurfaces, and by z : X NHg — M the corresponding 
universal family of smooth, degree d hypersurface sections of X . Concretely, if k is 
afield and ¢ : R > k is aring homomorphism, then M;.g is the parameter space for 
smooth, degree d hypersurfaces which are transversal to X;4. For each prime ¢, we 
have the lisse (but not necessarily torsion-free) Zy-sheaf R*"2,Z;:(n) on M[1/€], 
together with its cup product pairing 


R7"1,Z¢(n) x R7"2,Z,(n) > R“"2,Z¢(2n) & Ze, 


which is an orthogonal autoduality modulo torsion. Let us denote by p : X — 
Spec(R) the structural morphism of X/R. On Spec(R[1/£]), we have the lisse Zy- 
sheaf R?" p,Z,(n), and we denote by R*" p,Zc(n) mji/q its pullback to M[1/€]. The 
canonical restriction map on cohomology gives an inclusion 


R*" 0,Ze(n) upg CR" m1.Ze(n). 


We denote by 
Evz, C R*"1,Z¢(n) 
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the orthogonal to R?” p,Z:(n) m{i/q under the cup product pairing. The lisse sheaves 
Evz, on M[1/€],carry the induced cup product pairing 


Evz, x Evz, > Ze. 


If we tensor this situation with Q¢, the Hard Lefschetz Theorem [De-Weil II, 4.1.2] 
tells us that this pairing on Evg, := Evz, ®z, Qe is an orthogonal autoduality. By 
the Riemann Hypothesis for projective smooth varieties over finite fields [De-Weil I, 
1.6], we know that the sheaves F'vg, are pure of weight zero, and form a compatible 
system. By Gabber’s theorem [Gab-Tors], for all but finitely many primes ¢, the 
sheaves R*" po, Ze (n), R2”" p,Z¢ (n)mii/e), and E'vz, are all torsion free, and the cup 
product pairing makes E'vz, orthogonally self dual over Ze. A fundamental result of 
Deligne [De-Weil I, 4.4.1,4.4.2s,and 4.4.9], amplified by [Ka-LAMM, 2.2.4] and 
[Ka-Pan, Corollaries 2 and 3], tells us that the condition (2 strong) holds for the 
compatible system given by the E'vg,, provided that d > 3 if 2n > 4, or that d > 4 
if 2n = 2, and that, under these conditions, the common rank of the sheaves E'vg, 
is > 9. So Theorem 5.1 applies to this situation. 

Let us spell out the simplest case of this situation. We take R to be Z, and we 
take X¥ = P = P?"*'/Z, 2n > 2, with the identical embedding of P into itself. We 
fix an integer d with d > 3 if 2n > 4, ord > 4if 2n = 2. Fora finite field k = Fy, 
and a smooth hypersurface Hy/k of degree d and dimension 2n over k, we know 
that its Zeta function is of the form 


2n 
Zeta(Ha/k,T) =1/P(Ha/k,T)|[Q-q'T). 
i=0 


Here P(Ha/k,T) € Z[T] is the polynomial whose unitarization 
P,(Ha/k, Fr) = P(Ha/k, T/q") 


is given by 
P,(Ha/k,T) = det((1 — TFrobg,x,|Evo,), 


for any prime ¢ invertible in k. The reduced unitarization P, eq is defined by 
Purea(Ha/k, T) := Rdet((1 — TFrob,;,7,|Evo,). 


For each finite field k, we denote by IrrFrac(k, d, 2) the fraction of the smooth 
hypersurfaces Hz /k of degree d and dimension 27 over k for which the polynomial 
Purea( Ha /k, T) is Q-irreducible. Then Theorem 5.1 gives us the following result. 


Theorem 8.1 Jn any sequence i ++ k; of finite fields whose cardinalities are strictly 
increasing, the sequence of fractions i +> IrrFrac(k;,d,2n) tend to 1. 
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9 Alternative approaches 


The Chavdarov approach to studying irreducibility requires knowledge of mod ¢ 
monodromy for infinitely many primes £. We have used Larsen’s theorem [Lar-Max, 
3.17] to infer, from information about the £-adic monodromy for all (invertible 
on the base) £, information about mod € monodromy for a set of primes ¢ of 
Dirichlet density one. There are two other approaches, which, when they apply, 
give information about mod £ monodromy for all but finitely many primes @. 

The first is based on the theorem of Mathews, Vaserstein, and Weisfeller 
[MVWF, which concerns a smooth groupscheme G/Z[1/N] whose complex fibre 
Gc is a connected, semisimple, simply connected group, and a finitely generated 
subgroup [ Cc G(Z[1/N]) which is Zariski dense in Gc. For any ¢ which is prime 
to N, we have a “reduction mod £” homomorphism 


lc G(Z[l/N]) > G(F2). 


The theorem asserts that [ maps onto G(F¢) for all sufficiently large 2. 

Let us explain an instance of when we can apply this method, and what kind of 
result it gives. Let us put ourselves in the general setup E/U/M/R of Section 3, 
and assume that conditions (1) and (2weak) of that section hold, and that d > 3. 
Pick an embedding of R into C, and make the corresponding extension of scalars. 
As explained in the proof of Theorem 3.1, we have, for some integer N > 1, an 
orthogonally self-dual lisse sheaf Hy, jny OD M*. Enlarging N, we will further 
suppose that N is even. Let us denote by 


pi: 11 (M™) > O(d,Z[1/N)]) 


the corresponding “transcendental” monodromy representation attached to Hah /N}" 
For every prime £ not dividing N, we also have the algebro-geometric ¢-adic 
monodromy of Hz,|Mc, 


pce : (Mc) > O(d, Zz). 


By the comparison theorem, the algebro-geometric fundamental group (Mc) is 
the profinite completion of 7 (M*"). For every ¢ not dividing N, the €-adic image 
pce(1(Mc)) C O(d,Z,) is the closure (in O(d, Zz) with its profinite topology) 
of the topological image p@'(z1(M™)) C O(d, Z[1/N]). By Pink’s specialization 
theorem [Ka-ESDE, 8.18.2, (2)] applied to Hz, on M[1/€], we may infer from 
condition (2weak) that the £-adic image pc.¢(7,(Mc)) is Zariski dense in either 
O(d,Q)) or in SO(d, Q,). By the ¢-adic continuity of polynomial functions, it then 


Unlike the Larsen result or the Zalesskii-Serezkin result to be discussed below, this result [MVW] 
depends upon the classification of finite simple groups. 
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follows that the topological image p#(7;(M*")) C O(d, Z[1/N]) is Zariski dense 
in the same group, either O(d, Q,) or SO(d, Q;). Picking an embedding of fields 
QC C, we see that the topological image is Zariski dense in either O(d, C) or in 
SO(d,C). Since the topological fundamental group 7,(M*") is finitely generated, 
its image 

Py = pe'(m(M™)) C Od, Z[1/N]) 


is a finitely generated subgroup of O(d, Z[1/N]) which is Zariski dense in either 
O(d) or SO(d). We cannot yet apply [MV W], because the orthogonal group O(d) 
is not connected and its identity component SO(d) is not simply connected. We get 
around this difficulty following an argument of Ron Livne. First, replace I’; by the 
subgroup [2 C I, of index 1 or 2 consisting of the elements of determinant +1. 
Then [> is a finitely generated, Zariski dense subgroup of SO(d, Z[1/N]). Next 
consider the Spin group attached to our orthogonal group. The spinor norm gives an 
exact sequence 


{1} > £1 > Spin(d, Z[1/N]) > SO(d,Z[1/N]) > Z[1/N]*/(Z[1/N]*)2, 


in which the last term, Z[1/N]*/(Z[1/N]*)?, is finite, generated by —1 and by the 
primes dividing N. Now consider the composite homomorphism 


Ty Cc SO(d,Z[1/N]) > Z[1/N]*/(Z[1/N]*)’. 
Its image is finite. So the subgroup 
3 := Ker(T'y > Z[1/N]*/(Z[1/N]*)?) C Ty 


is a subgroup of finite index in Ts, so is still Zariski dense in SO, and still finitely 
generated. Every element of I’; lifts, in two different ways, to Spin(d, Z[1/N)]). 
Denote by 

Tc Spin(d, Z[1/N]) 


the complete inverse image of I3. This group I is finitely generated (because 3 
is), and, as it maps onto [T3, it is Zariski dense in Spin. We may now apply the 
theorem of Mathews, Vaserstein, and Weisfeller [MVW], tol’ C Spin(d, Z[1/N]), 
to conclude that for all sufficiently large @ prime to N, say for all ¢ not in the 
finite set S, P maps onto Spin(d, F;). For any such £, 3 maps onto the image of 
Spin(d, F,) in SO(d, F,), i.e., '3 maps onto Q(d, Fr). So for any such £, the image 
of TP; in O(d, F¢) contains Q(d,F:). Because the algebro-geometric fundamental 
group 21(Mc) is the profinite completion of 2;(M*"), this last image is also the 
image of 21(Mc) in O(d, Fe). Thus we find that the image of 2,;(Mc) in O(d, F,) 
contains Q(d,F,) for every @ notin S. 

So far, all of this is taking place on the complex fibre of M/R. Let us say that 
M/R is nicely compactifiable if there exists a proper smooth R-scheme M’/R 
and a divisor D C M% which has normal crossings relative to R, such that M = 
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M*\ D. By resolution over the characteristic zero fraction field of R, we know that 
there exists a nonzero r € R such that M[1/r]/R[1/r] is nicely compactifiable. 
[This passage, from R to some R[1/r], is not entirely harmless. For instance, in 
the second example, of Weierstrass families, where we start, for a given (do, d3), 
with R = Z[1/6] and the corresponding M = Mz, a,/Z[1/6], we do not know 
which, if any, other primes p we need to invert to get a nice compactification, nor 
do we know how this set of p depends on (d>, d3). In our 2004-2005 course, we 
followed the [MVW] method when M/R was nicely compactifiable, as explained 
in the next paragraph, but then invoked the Larsen method to handle separately each 
of the finitely many unknown bad p.] 


When M/R is nicely compactifiable, with R a normal integral domain whose 
fraction field has characteristic zero, Abhyankar’s lemma [SGA 1, XIII, 5.5] assures 
us that for any lisse sheaf on M[1/£], and any geometric point s of Spec(R[1/£]), its 
restriction to the geometric fibre M, of M[1/£]/R[1/€] is tamely ramified at each 
maximal point of D,. We apply this to the lisse sheaf z, ® Fy on M[1/€], for each 
£ notin S. The Tame Specialization Theorem [Ka-ESDE, 8.17.14] then tells us that 
for every £ not in S, and for every geometric point s of Spec(R[1/€]), the image of 
z1(M,) in O(d, Fe) contains Q(d, Fe). 

We now turn to a second approach’ to controlling the mod £ monodromy for all 
but finitely many £. This approach is based on the Zalesskii-Serezkin classification 
[Zal-Ser, Theorem, page 478] of irreducible subgroups of GL(n, F2), £>3,n > 3, 
which are generated by reflections and which contain no transvections (:=unipotent 
pseudoreflections). We can apply this to describe all irreducible subgroups of 
orthogonal groups in odd characteristic generated by reflections because such 
orthogonal groups contain no transvections. Here is a baby version of their result 
in this case. 


Theorem 9.1 (Zalesskii-Serezkin) Given an integer n > 3, there exists a constant 
C(n) with the following property. Let € > 3, and (V,W) an n-dimensional 
Fe-vector space with a symmetric autoduality UV. Let G C O(V,W) be an 
irreducible subgroup generated by reflections. Denote by No(G) the normalizer 
of G in O(V, WV). Then either Q(V,Y) C G, or we have the divisibility estimate 
#No(G)|C(n). Moreover, ifn > 9, we can take C(n) = 2"(n + 2)!. 


Proof. We begin by recalling that if G C O(V,W) is an irreducible subgroup 
generated by reflections, then G is absolutely irreducible (i.e., G acts irreducibly 
after extending scalars from F, to Fy). Now for any absolutely irreducible subgroup 
G Cc O(V, WY), we have the divisibility estimate 


#No(G)|2#Aut(G), 


simply because the kernel of the conjugation action homomorphism 


3A third approach would be to appeal to the results of Hall [Ha]. 
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No(G) — Aut(G) 


lies in the subgroup of scalars in O(V, VY), which is +1. 

It is immediate from [Zal-Ser, Theorem, page 478] that for n > 9, there are 
at most two primitive such groups G which fail to contain Q(V, V), namely the 
symmetric group S,+1, if £ is prime to n + 1, and the symmetric group Sy+2, 
if € divides n + 2. For these G, every automorphism is inner. For 3 < n < 
8, there are finitely many more such primitive G, and these we handle by the 
#No(G)|2#Aut(G) divisibility. 

We now consider the imprimitive such G. For n > 5, any imprimitive such 
group has a unique [Zal-Ser, 4.1] system of imprimitivity consisting of the lines L; 
spanned by-linearly independent vectors e;, and the induced homomorphism maps 
G onto the symmetric group S,,. For each of n = 3 andn = 4, there is at most one 
imprimitive G for which the system of imprimitivity is not unique [Zal-Ser, 4.1], 
and these cases are handled by the #No(G)|2#Aut(G) divisibility. 

It remains to treat the case of an imprimitive such G which admits a unique 
system of imprimitivity. By uniqueness, the system of imprimitivity is respected by 
No(G), so we have a homorphism of Ng(G) onto S,,. It remains only to show the 
following claim: in the basis given by the vectors e;, any element g € No(G) which 
lies in the kernel of this homomorphism, i.e., which is diagonal, has entries each +1. 
Indeed, we will show that any element g € O(V, Y) which is diagonal in this basis 
has entries +1. Let us denote by 4; the diagonal entries of g. 

From the fact that G induces every possible permutation of the lines L;, we see 
that 


(1) Either all square lengths Y(e;, e;) are nonzero, or they are all zero. 
(2) Either all cross terms Y(e;,e;) are nonzero, for alli # j, or they are all zero. 


If all W(e;, e;) are nonzero, our claim is obvious, since 
AWE, e:) = V(g(ei), (ei) = Ver, 1). 


If all W(e;, e;) vanish, then by nondegeneracy all W(e;, e;) are nonzero, for alli # 
j. From the identity 


AjiAj V(ei,e;) = V(g (ei), g(e;)) = V(Ei,e;), 


we then infer that for every i # j, we have A;A; = 1, which in turn forces all A; to 
be equal to each other, with common value +1. Oo 


Armed with this result, we can prove an “almost all €” result about the mod 
£ monodromy of Lefschetz pencil of even fibre dimension 2n > 2. Let us put 
ourselves in the situation of Section 8, but taking now the base ring R to be a finite 
field k. We take the degree d of the hypersurface sections large enough that the 
common rank N of the sheaves Evg,, for every ¢ invertible in k, is > 3. We suppose 
that the condition (2 strong) holds, and that there exist, over k, Lefschetz pencils on 
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X of hypersurface sections of degree d for which (2 strong) holds as well. [As noted 
above, the first condition is automatic if d > 3 and d + 2n > 6, in which case we 
have N > 9. Moreover, in this case Lefschetz pencils exist, and sufficiently general 
ones will satisify (2 strong).] 


Theorem 9.2 For all sufficiently large primes £, the image of the geometric 
fundamental group 1\(M;z) in O(N, Fe) under the monodromy representation of 
Evz, ® Fe contains Q(N,F;). More precisely, it is the following subgroup of 
O(N, F¢): if (—1)"2 is a square in Fe, it is the subgroup of elements of spinor 
norm one. If not, it is the subgroup of elements having sp = det. Moreover, for 
any (sufficiently general, if char(k) = 2) Lefschetz pencil satisfying (2 strong), we 
have the same results for the image of its geometric monodromy, with a possibly 
larger set of “bad” €. 


Proof. By Gabber’s theorem [Gab-Tors], applied both to X and to any single 
smooth hypersurface section X 1 Hg of degree d, we know that for all but finitely 
many £, both spaces have their Zp-cohomology torsion-free, and the hard Lefschetz 
theorem holds mod ¢ on both. These are the “good ¢” for the theorem. Because the 
fibre dimension 27 is even, we know, by [SGA 7 II, XV 3.4, XVIII 6.2 and 6.3], 
that “condition A” of [SGA 7 I, XVIII 5.3.5] holds for any Lefschetz pencil on 
X. An attentive reading of the entire exposé [SGA 7 II, XVIII] then shows that for 
all these good £, the mod £ geometric monodromy of any (sufficiently general, if 
char(k) = 2) Lefschetz pencil is an irreducible subgroup of O(N, F,) (this uses 
the conjugacy of the vanishing cycles [De-Weil II, 4.2.7]) which is generated by 
reflections in various vectors 6; with square length 6; - 6; = (—1)"2 (this is the 
Picard-Lefschetz formula [SGA 7 II, XV 3.4]). 

Let us begin with a Lefschetz pencil, defined over k and hence over some finite 
extension E/k, for which (2 strong) holds. Since the statements to be proven are 
geometric, we may extend scalars, and reduce to the case when our Lefschetz 
pencil satisfying (2 strong) is defined over k. From the theorem of Zalesskii— 
Serezkin above, we see for a given good £, there are only two possibilities: either 
the image T'seom, moa ¢ Of the geometric monodromy group of our Lefschetz pencil 
is the asserted group, or its normalizer No(Tgeom, mod ¢), Which contains the mod 
£ image Puarith, moa ¢ Of the arithmetic monodromy group, is a group whose order 
divides C(N). We will show that this can happen for only finitely many good ¢. 
Indeed, we will show that the inequality #V arith, moa ¢ < CCN) can hold for only 
finitely many good €. For this, we argue as follows. 

Because our pencil satisfies (2 strong), we know by Deligne’s equidistribution 
theorem, cf. [Ka-GKM, 3.6], that as we run over larger and larger finite extensions 
E/k, and consider all the smooth, degree d hypersurface sections X N Hq defined 
over E, the (unique conjugacy classes having) reversed characteristic polynomials 


det(1 — TFroby.ynu,|Ev") € Z[1/#K][T] 


become equidistributed, for (the direct image of) Haar measure, in the space 
O(N,R)* of conjugacy classes in the compact orthogonal group O(N,R). The 
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space O(N,R)* is a compact metric space [namely, the set of all degree N 
polynomials in 1 + TR[T] all of whose roots lie on the unit circle], every nonempty 
open set has strictly positive measure, and it is infinite. So if we take 1 + C(N) 
distinct points A; in O(N,IR)*, and tiny open balls B; around A; which are 
pairwise disjoint, then for E sufficiently large, we can find 1 + C(N) different 
smooth, degree d hypersurface sections X¥ M Hg; defined over EF such that the 
reversed characteristic polynomial of Froby,yqy,; lands in B;. So these reversed 
characteristic polynomials are pairwise distinct. Let us enumerate these polynomi- 
als, say Po(T), Pi(T),..., Pec) (T). Now consider the product polynomial 


RT) := [i - P7)). 


0<i <j <C(N) 


This is a nonzero polynomial in Z[1/#k][T], hence it is nonzero mod all sufficiently 
large primes £. For any good prime £ mod which it is nonzero, the 1 + C(N) 
Frobenius conjugacy classes Froby,yn4,; must have distinct images in Tyrith, moae, 
since they have distinct mod ¢ characteristic polynomials. So certainly we have 
#T arith, mode = 1 + C(NV) for these good €. 

To treat the situation over M itself, we note that our single Lefschetz pencil 
above shows us for all sufficiently large good primes £, the image of the geometric 
fundamental group 2,(M;) in O(N, Fe) under the monodromy representation of 
Evz, ® F, contains the asserted subgroup of O(N, F;¢). To see that this image can 
be no bigger, use the fact that for any given good prime £, Bertini’s theorem says that 
already a sufficiently general Lefschetz pencil will have the same mod £ geometric 
monodromy as does M itself. Since other choices of Lefschetz pencils may require 
omitting fewer good £ than did our initial choice, the result over M may have fewer 
bad £ that the result for some particular choice of Lefschetz pencil. oO 
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Diophantine methods for hyperbolic curves 
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Dedicated to the memory of Serge Lang 


Abstract In a letter from Grothendieck to Faltings, it was suggested that a positive 
answer to the section conjecture should imply finiteness of points on hyperbolic 
curves over number fields. In this paper, we point out instead the analogy between 
the section conjecture and the finiteness conjecture for the Tate-Shafarevich group 
of elliptic curves. That is, the section conjecture should provide a terminating 
algorithm for finding all rational points on a hyperbolic curve equipped with a 
rational point. 
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In earlier articles [8—10] attention was drawn to the parallel between the ideas 
surrounding the well-known conjecture of Birch and Swinnerton-Dyer (BSD) for 
elliptic curves, and the mysterious section conjecture of Grothendieck [6] that 
concerns hyperbolic curves. We wish to explain here some preliminary ideas for 
“effective non-abelian descent” on hyperbolic curves equipped with at least one 
rational point. We again follow in an obvious manner the method of descent on 
elliptic curves and therefore rely on conjectures. In fact, the main point is to 
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substitute the section conjecture for the finiteness of the Tate-Shafarevich group. 
That is to say, the input of the section conjecture is of the form 


section conjecture => termination of descent. 


At a number of different lectures delivered by the author on the topic of fundamental 
groups and Diophantine geometry, the question was raised about the role of surjec- 
tivity in the section conjecture as far as Diophantine applications are concerned. The 
demonstration of this implication is intended as something of a reply. 

To start the descent, on the other hand, requires the use of p-adic Hodge theory 
and the unipotent Albanese map. In this process, in general, another conjecture 
is unfortunately needed. It could be, for example, the Bloch—Kato conjecture on 
surjectivity of the p-adic Chern class map that has been referred to in [9]. In other 
words, via the construction of Selmer varieties and Albanese maps, one deduces an 
implication 


Bloch—Kato conjecture => beginning of descent. 


The main caveat here arises from the lack of actual knowledge of computational 
issues on the part of the author. To avoid misleading anyone about what is being 
achieved here, we have in the following section separated out the questionable 
portions as hypotheses [H] and [H’]. That is to say, the objects that mediate this 
process, namely Galois cohomology groups/varieties and maps between them, seem 
in principle to be computable. But even to the algorithmically illiterate perspective, 
it is obvious that actual computation would be daunting to the point of impossibility 
given the technology of the present day. Nevertheless, it is perhaps not entirely 
devoid of value to point out one direction of investigation in effective methods, in 
the hope that even incompetent strategies may eventually be improved through the 
focusing of sharper skills obviously available in the community. Hence the present 
paper. 

One point of some theoretical interest concerns the comparison with “effective 
Mordell conjectures” in the usual sense where upper bounds for heights are 
proposed. If we fix a point b on the curve and measure heights with respect to 
the corresponding divisor, the height of another point measures the inverse distance 
from b at all places. So an upper bound for the height corresponds to a lower bound 
for the distance from b at all places. On the other hand, what the p-adic Hodge 
theory provides (in principle) is a lower bound for the p-adic distance between all 
pairs of points at one place. This lower bound is exactly what is required to start the 
descent. 

Finally, we make the obvious point that the use of conjectures is probably not 
a serious obstacle from the computational perspective (that is, in comparison to 
the problem of feasibility). This is in the same spirit as the standard algorithms 
for computing Mordell—Weil groups of elliptic curves where the BSD conjecture is 
employed with just a few misgivings [3]. 
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1 Brief review 


Here we will be intentionally brief, referring the reader to [4] and [9] for a more 
thorough discussion. 

Let X /Q be a proper smooth hyperbolic curve of genus g with a point b € X(Q) 
and let S be the set of primes of bad reduction for X. In the following, we shall 
be a bit sloppy and mostly omit separate notation for an integral model of X. 
Choose a prime p ¢ S and let U + = me (X,b) be the Q,-unipotent étale 
fundamental group of X := X Xspeccg) Spec(Q) and U* = (U*)"*!\U* its 
quotient by the (7 + 1)th level of the descending central series normalized so that 
(U*)! = U*. Let I be the Galois group of Q over Q. We have defined the Selmer 
varieties 


Hy, Uf) 
[9, 10] classifying [’-equivariant torsors for ue ci are unramified at all places not 
in {p}US and crystalline at p. (This is A; (Cr, US,) in the notation of [8] and [9], 
where T = S U {p}.) These are affine algebraic varieties over Q, whose algebraic 
structures are defined inductively starting from 
Hy (0, Uy) & HYD, Hi(X, Q,)), 
which is a Q, vector space. Recall the fundamental diagram ([9], end of section 2) 


X(Q ——— X(Q)) 


ét,glob ét,loc 
Kn Kn 


loc ety 


‘ >. 

Hy(T,U,') + Hy (Gp, U,* USP? 

Here, Hy (Gp, U, St) classifies Gp := Gal(Q p/Qp)-equivariant torsors for ue that 
are crystalline, while U,"/F° classifies compatible pairs 7," ~ 7; of torsors for 
the de Rham and crystalline fundamental groups U;“" and U‘' equipped with Hodge 
filtrations and Frobenius endomorphisms compatible with the torsor structures. The 
maps associate to each point x € X(Q) the class of the torsor of paths from b to x 
in the appropriate category. So 


iS (x) = [f” FB, x) 
with T-action, 


xcbloc (x) = [nyo (X; b, X)n] 


n 
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with G,-action, and 
aa) — [a(x ® Q,: b,x) = no(Y b, 21, 


where Y is the reduction mod p of a smooth Z[1/S] model for X. 
In contrast to this mass of notation, the section conjecture considers just one map 


&: XQ—H'(T, m1 (X,b)) 
that sends a point x € X(Q) to the class of the profinite torsor of paths 
71(X;b, x) 


with I-action. It proposes that this map should be a bijection. The injectivity is 
already known as a consequence of the Mordell—Weil theorem for the Jacobian J of 
X, while the surjectivity seems to be a very deep problem. The question mentioned 
in the introduction arises exactly because the injectivity appears, at first glance, to 
be more relevant for finiteness than the surjectivity. The idea for using the bijectivity 
seems to have been to create a tension between the compact profinite topology of 
H'(V, %(X,b)) and the “discrete nature” of X(Q). At present it is unclear how this 
intuition is to be realized. But as mentioned, when the finiteness is obtained through 
a different approach, we wish to explain the use of the surjectivity for finding the 
full set of points. 
Using the exact sequence 


02 U"'\U" 3 U41 Un 0 


for each of the fundamental groups, the global Selmer variety is fibered according 
to the sequence 


0 A(T, US '\(U)) > APL, Un HP Up). 
which means that the kernel acts on the variety in the middle with orbit space a 


subset of the third object. If we denote by r,, the dimension of U,,, there is a recursive 
formula [11] 


Zimiri = (g + Vg? —1)" + (g— Vg? 1)" 


which implies in particular that 


tm = (g + Vg2—1)"/n+ O(g"”) 


for some O(g”/?) that can be explicitly computed. 
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The global Selmer variety has its dimension controlled by the Euler characteristic 
formula for the cohomology of the group '7 = Gal(Qr/Q), where T = S U {p} 
and Qr is the maximal extension of Q unramified outside T ([9], Section 3). 
It reads 


dimH (Tr, oy io") = dimH?([r, (Corr ery) 
_ diml(U yar y 7, 
where the minus in the superscript refers to the sign for the action of complex 
conjugation. The dimension of this minus part can be estimated as follows. The 
action of complex conjugation on the étale fundamental group U“ is compatible 


with its action on the Betti realization U® of the motivic fundamental group [4], 
according to which 


(UBy"th\ (yByn 

has a pure Hodge structure of weight n. So when 7 is odd, we get 
dim[(U yy" OF y"|- 7,2: 

But when n = 2m is even, there is the contribution from the (m,m) component 

to consider, which can be complicated. This (m,m) component is a quotient of the 


(m,m)-part of 
Hy (X(C), Bh ; 


which has dimension cy) g”"". So for simplicity, we will just use the tautological 
estimate 

dim[(US)"'\(US)" Sm 
for n even. 


In [9], Section 3, we analyzed the use of the corresponding Tate—Shafarevich 
groups 


IP (US) '\(US)")) = KerlH? (Pr, US)"*'\US)") 
> Ger WG, (US \U)")), 
which is dual to 
TH! ((US)"*'\(US)")* (1) = KerlH (Pr, (US)"*'\(U%)")*(1)) 


> Ber ANG, (UT '\U)")* YI. 
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There is a Chern class map [1] 
chy: : KL? _,(X") @ Q, > A(T, H"(X", Qp(1))) 

forn # 1 whose image lies in a “geometric” subspace 

Hy (Gal, H"(X", Q,(1))) 
that contains 

T'(H"(X", Q,(1))). 
In fact, 
mm (usy"*\(o%)")* ()) 

is a subspace of IT! (H"(X", Q,(1))) because the representation Cyr" (eye 


isa direct summand of Hy" (x ,Q A a which, in turn, is a direct summand of 
(H"(X", Q,))*. But Bloch and Kato conjecture that 


chy: K,_,(X") ® Q, +H! Gal(Q/Q), H"(X", Q,(1))) 
is an isomorphism. Thus, when n > 2, we get 
mn (us) "\(U)")1"()) = 0. 


We recall the explicit bound for the local H ? ({9], Section 3). For v Hx p, we 
have 


dimH2(G,, (U%)"*!\(U%)") < ng" +2 (2¢)?g""?, 


(n— 1) 
2 
while 
dimH?(Gp, (U%)"*!\(U")") < ng". 
Finally, as regards the contribution of the Hodge filtration, we saw in [9] that 
sr ay ae 


so that 


dim(U*)"t'"\(US "4" /F° > rm, — g" = (g + Vg? — 1)"/n— 8" + O(8"”). 
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2 Beginning the descent 


Since it costs very little extra work to define, we will in fact consider the refined 
Selmer variety 


Hyg. Uy’) C HPL, Uy) 
consisting of classes whose images in 
H;(T, US") 


go to zero under all localization maps 


4. LOC, é 
H;(, US) >" H'(G,, U7) 


for v # p. As explained in [10], the image of X(Q) under xq"? lies in 
Hy (T : Us, From the estimates of the previous section, it is obvious that 


assuming the Bloch—Kato conjecture, there is an effectively computable t such 
that 


dimH f(T, U") < dimU,"/F° 


forn >t. 


Of course the computation starts out with an estimate for dimH! (1, U ie which 
according to the usual BSD is the same as the Mordell—Weil rank of J. After that, the 
dimension of dimH fol US) grows as a function of n with an explicit upper bound, 


while the dimension of ue / F° grows with an explicit (and eventually bigger) lower 
bound. Written out, the estimate for growth looks like 


dimH fo(T, Usy.41) S dimH fo (V. Uy) 


+12n/2 + |S|[2n)g™" + Cnn Dog yg?) + (2n)g?" 
and 
dimH fo (T, USi4y) < dimH fo (VU, Using 1) + rant 
+1S[f2n + gt 4 SAF OEM oye + (ont yg, 
while 


dimU, +1 > dimU;, + rn — g". 
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We eventually get an inequality in the right direction because of the asymptotic 
behavior of r,,. In this regard, note that g + /g2—1> g for g > 2. 
As a consequence of the discrepancy in dimension, the image of 


D olocy : Hyo(T, U;*)>U"/F° 


is not Zariski dense. In contrast to difficult sets like X(Q), the classifying spaces 
for torsors and the maps between them are algebro-geometric objects which can be 
computed in principle. This should work in the manner of computations with the 
usual method of Chabauty as appears, for example, in [7] (cf. the discussion of 0 
in the introduction). In case this is not convincing, we will adopt it as an additional 
hypothesis: 


[H]: The map 
D olocy : Hjo(T, Us") >U""/F° 
can be computed. 

The end result of this is that assuming B-K and [H], we can find an algebraic 
function a on U;"/ F° that vanishes on the image of H fol U;). Now, when we 
restrict a to X(Q,) it becomes a linear combination of p-adic iterated integrals. 
To elaborate on this point a little more, recall ([9], Section 1) the description of 
the coordinate ring of the de Rham fundamental group U*"° for an affine curve X° 
obtained by deleting some rational divisor from X. In this case, when we choose a 


collection a1, d2,...,a, of algebraic differential forms on X : inducing a basis of 
H} (X°), the coordinate ring of U“"° has the form 


Qp (aw), 


the Q, vector space generated by symbols a,,, one for each finite sequence w of 
numbers from {1,2,...,}. Furthermore, on X°(Z,), there is a lifting (depending 
on the previous choice of basis) 


dr,0 
US 


X°(Zp) ene yt? / Fo 


such that the restriction of a,, for w = (i, i2,...,i;) to X° (Zp) has the form 


Ay (Zz) =| Qj, Qj, *** Aj. 
b 
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Also, there is a functorial map 
US us 


compatible with the Hodge filtration so that the function w on ie / F° can be lifted 
to us, That is to say, one can construct a diagram 


dr,0 
US 


X°(Z,) — Us] F° 


enabling us to compute the restriction of a to X°(Z,) in terms of the a,,. The 
idea would be to carry this process out for two separate affine X° so as to cover 
X(Z,) and then to express a in terms of iterated integrals on each affine open set. 
Of course, the problem of explicitly computing the local liftings is also a daunting 
task, although possible in theory. The author makes no pretense of knowing, as yet, 
how to reduce this to a tractable process. Perhaps it is safer to state it also explicitly 
as a hypothesis: 


[H’]: The map 
gan us / F® 


can be computed. 


Choose a representative y € X(Q,) for each point in Y(F,) (= X mod p) anda 
coordinate z, centered at y. We must then approximate the zeros of a on X(Q,) by 
expressing it as a power series in the z,. This needs to be carried out to a sufficiently 
high degree of accuracy so that we can find an M and a finite collection y; € X(Q,) 
for which 


Iyilur= {x € XQp)lzy,@) S P™Y 


contains at most one zero of a. That is to say, we need to separate the zeros of a 
modulo p™. Note that even at this point, since all expressions will be approximate, 
there would be no way to determine which of the y; relate to actual points of X(Q), 
even though an upper bound for the number of points may be available, as was 
emphasized by Coleman [2]. In fact, the process of separating the points using 
small disks already seems to occur, at least implicitly, in the method of Coleman— 
Chabauty. In the next section we will see how to combine that separation with the 
section conjecture. 
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We summarize the preceding passages as follows: 


Observation 1 Assuming the Bloch—Kato conjecture and the hypotheses [H] and 
[H'], there is an effectively computable M such that the map 


M 
XQ XQ) X(Z/p”) 
is injective. 
In our view, this statement is one rather essential justification for studying the 
Selmer varieties and unipotent Albanese maps. That is, Faltings’ theorem as it stands 


does not seem to give, even in principle, a way of getting at this sort of effectivity. 
To belabor the obvious, the point is that the map 


X(Q)>X(Qp) 


is not a priori (i.e., before finding X(Q)) computable, even in principle, in contrast 
to the algebraic map 


Hy (1, O08) FP, 


When we embed X(Q) inside J(Q) using the base point b, we see then that we 
have an injection 


XQ I(Z/p™). 
But the kernel of the reduction map 
JQ J(Z/p") 


is of finite index, and hence contains NJ(Q) for some N. For example, one could 
take N = |J(Z/p™)|, which, in turn, can be computed from the formula 


|J(Z/p™)| = pe TEL, 


since p is a prime of good reduction. So finally, we arrive at an effectively 
computable N such that 


XQ IQ JQ/NIQ 
is injective. Let To be S together with the set of primes dividing N, and I’7, the 
fundamental group of Spec(Z[1/7o]) with base point given by Z[1/7T)]|—~Q—Q. 
Thus we get an injection 


XQeH' (Tn, JIN) 


that allows us to begin descent. 
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3 Non-abelian descent and its termination 


Once we have the final conclusion of the previous section, we can dispense entirely 
with the unipotent machinery and start to deal with the profinite formalism. There 
are many ways to construct a cofinal system for 


A := 7(X,b), 


of which we will use one described in a letter from Deligne to Thakur [5]. Let 
K, C A be the intersection of all open subgroups of index < n. It is a characteristic 
open subgroup, and hence we can form the finite quotient A(m) := A/K,. The 
order of this quotient has all prime divisors < n. Let T,, denote the fundamental 
group of Spec(Z[1/n!]). We also denote by (7) the quotient of 7,(X,b) by K,,a 
group that fits into the exact sequence 


0— A(n)>2(n)>T 0. 


For n larger than any prime in S, there is a pull-back diagram ([{12], proof of 
Theorem 2.8) 


0 > A(n) > (nN) > — 
iB 


0 - A(n) r (Vn) /Kn > In > 0 


where %, is a proper smooth model for X over Spec(Z[1/n!]). Therefore, we see 
that any point x € X(Q) defines a class in 


H'\(T,, A(n)) 


and that we have a commutative diagram 
X(Q) —*+ H'(,) 
() 


H'(T,, A()) @ A'(L, A(n)) 
There is a sequence of subsets containing X(Q), 


H\(r, A); c HT, A), 
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consisting of those classes whose projection to H'(I’, A(i)) lies in the image of 
H'(T;, AM) A(T, AW). 

Let no be larger than the primes in 7p. Then we have diagrams 


Hr, A); ——+ HP, A) 


Y 


AN(T;, AM) S ANT, AW) (2) 


Y 


ANT, JIN) — A(T; JIN) 
fori > no. Using this, we can define a decreasing sequence of subsets 
H' (Uy. J[N)n C A! JIN) 


for n > no consisting of those classes whose images in H'(T;, J[N]) lift to 
H'(T;, AQ@)) for all no < i <n. Forn > no, we also have a commutative diagram 


X(Q) —— H'(T,, A@)) 
(3) 
H'(Tn, JIN]) <> Ht (En, J[N)) 
Meanwhile, there is an increasing sequence 
X@n C XQ Cc A(T, JIN) 
consisting of the points with height (in some projective embedding) < n. We 


visualize the situation using the sort of filtration familiar from the arithmetic theory 
of elliptic curves: 


= X(Q)n Cc X(Q)n41 ee a H'(Tn, J[N]))m+1 
CHI, JINDm Co C An, JIN). (4) 
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Observation 2 The section conjecture implies that 
1 
X(Q)n =H (Th, JIN] m 


forn,m sufficiently large. At this point, X(Q) = X(Q)n. 


That is to say, we know when to stop searching. The simple proof is written out 
just to make sure the author is not confused. 


Proof. Assume the section conjecture. Then by diagrams (1) and (2), we have 
HiT, A); = H'(7, A) 
for all i and we actually have maps 
H'(T, A)>H'(T;, Al) 
for each 7. Therefore, 


HVT, A) = lim H'(D, A(i)) = lim H'(Ti, A(i)). 


Claim: Suppose c € H'(T,,J[N]) is not in X(Q). Thenc ¢ H' (Tx, J[N])m 
for some m. 


Proof of claim. If c € H'(Um,J[N])m for each m, then H'(T, A(m))-D H! (Mn, 
A(m))-, the classes in H'(I', A(m)) that liftc ¢ H'(P,, J[N])c A'(L, J[N]) 
are non-empty for each m. Thus, the inverse limit lim H'(T, A(m)).D lim H! 
(Tn, A(m))c, containing an inverse limit of non-empty finite sets, is itself non- 
empty. Therefore, c would be in the image of H'(T, A), and hence, in the image of 


X(Q). 


Thus, eventually, X(Q) = H!(ln,J[N])m. Of course eventually X(Q), = 
X(Q). Now suppose 


X(Q)n = H'(ln, JIN ])n 


at any point. Then classes not in H!(I'7,, J[N])m cannot lift to H'(T, A)m. And 
hence, they are not in X(Q). That is to say, X¥(Q), = X(Q). oO 

All the cohomology sets occurring in diagram (3) are finite and thereby have 
the nature of being computable through explicit Galois theory. Thus, the filtra- 
tion (4) can be computed in principle. As mentioned in the introduction, the 
actual implementation of such an algorithm is obviously an entirely different 
matter. 
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Hershy Kisilevsky 


Dedicated to the memory of S. Lang 


Abstract Let E/Q be an elliptic curve defined over the rational field Q. We 
examine the rank of the Mordell—Weil group E(K) as K ranges over cubic 
extensions of Q. 


Key words Elliptic curves ¢ cubic fields * Mordell—Weil groups 
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1 Introduction 


Let E/Q be an elliptic curve defined over the rational field Q. For any finite 
extension K/Q, the Mordell—Weil group E(K) of K-rational points of E is a 
finitely generated abelian group and we denote by rg(K) the Z-rank of E(K), i.e., 
the number of copies of Z in its standard decomposition as a direct sum of cyclic 
groups. In [F-K-K] we studied the behaviour of rg(K) as K ranged over cyclic 
cubic extensions of Q and showed that the condition rg(K) > rg (Q) is controlled 
by the rational points on a certain K3-surface defined over Q. For many curves E, 
we showed that rg(K) > rg(Q) for an infinite number of cyclic cubic extensions 
K/Q, and asked whether this was the case for every elliptic curve FE. Assuming 
the Birch & Swinnerton-Dyer conjecture, this question translates to the following: 
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If L(E,s) is the L-function of E/Q, then does the twisted L-series L(E, y,s) 
vanish at s = 1 for an infinity of Dirichlet characters y of order three? 

In the present paper we consider this question as K/Q ranges over all cubic 
extensions. 

We define the families of fields we will consider. Let k be a global field and 
let k be an algebraic closure of k. Suppose that F/k is a separable extension of 
degree | or 2. 


Definition 1.1. Define  (F’) to be the collection of all separable cubic extensions 
K/k (ink) whose Galois closures are M = K - F. 


We will consider all number fields to be subfields of the complex numbers C, so 
that we consider Q c C. 

In this paper, given a cubic extension K/k, we will always write M for the Galois 
closure of K/k, and F for the quadratic resolvent field (so F = k is permitted). 
In the case that k = Q, then either F = Qor F = Q(VD), where D ¥ 1 is 
a fundamental discriminant. We write ¥(F) = A(1) if F = Q, and A(F) = 
F(D) if F = Q(VD), with F/Q a quadratic extension. 

The family ¥(1) of cyclic cubic extensions K/Q was studied in [F-K-K] and 
the family .¥ (—3) has been considered by T. Dokchitser [DoT], where 


F(-3) ={K = Q(w'm?) | m € Z,m notacube, i = 0, +1} 


and where is a primitive cube root of unity. 
The results of this paper are contained in the following theorem. 


Theorem 1.1. Let E be an elliptic curve defined over Q, and let D = | or let 
D # 1 be a fundamental discriminant. Let EP denote the (quadratic) twist of E 
by D. Then rg(K) > re(Q) for an infinite number of K € F(D) if any of the 
following hold: 


(a) Both rg(Q) = landrgo(Q) = 1. 

(b) D > Oand rank z E7,(Ko) = 1 for some Ko € F(D). 

(c) D < Oandrankz Ey,(Ko) = 1 for some Ko € F(D). We must also assume 
that the density hypothesis holds (and in the case that E has CM, we assume 
also that Q(/D) is distinct from F' = End(E) ®z Q). 

(d) (T: Dokchitser) D = —3. 

(e) D <0, and E is a semistable elliptic curve of conductor Ng, prime to D, such 


that sign(E, Q(VD)) = —1 and L'(E/Q(VD), 1) £ 0. 


Here F7,(Ko) is the group of points of E defined over Ko and with trace zero to 
E(Q), sign(E, F) is the sign of the functional equation of the L-function of EF 
viewed as an elliptic curve over the field F, and the density hypothesis is defined in 
Section 3. 


Conjecture 1. For every elliptic curve E/Q and for every family .F(D), there is 
an infinite number of K € ¥(D) for which rg(K) > re(Q). 
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Question 1. Let k be a global field. Then for every elliptic curve E/k and for 
every family .F(F) as above, is there an infinite number of K € ¥(F) for which 
re(K) > re(k)? 


When [F' : Q] = 2, the corresponding analytic question concerns the vanishing 
at s = 1 of the twist L(E,p,s) of L(E,s) by the character of the (unique) 
irreducible two-dimensional Artin representation associated with M. The situation 
for these L-functions differs from the cyclic case treated in [F-K-K] because 
the signs of the functional equations of the L(E,p,s) are (to a large degree) 
independent of p in the families #(D), D # 1. 
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2 Cubic families 


Fix a fundamental discriminant D, i.e., either D = 1, or D is an integer which 
is either square-free and = 1 (mod 4) or 4 times a square-free integer = 2,3 
(mod 4), and let F = Q(/D). We consider the family ¥ (D) of all cubic extension 
fields K /Q whose Galois closure M contains F,i.e., M = K+ F = K(VD). 

Unless otherwise stated, we assume, for the remainder of the paper, that D is a 
fundamental discriminant and that D ¥ 1, i.e., we assume that F = Q(VD) isa 
quadratic extension of the rational field Q. 

Let M3(D) be the composite of all cyclic cubic extensions of F. Then 
M3(D)/Q is a Galois extension. Let G3(D) denote the Galois group Gal(M3(D)/F) 
and let t be an automorphism of M3(D) of order 2 which lifts the non-trivial 
element of Gal(F/Q). Then G3(D) splits under the natural action of t as a direct 
sum 

G3(D) = G3(D)* 6 G3(D)-. 


Here G3(D)* = {o € G3(D) | t(o) = t-o-t~! = o*}. If M3(D)* is the subfield 
of M3(D) fixed by G3(D)*, then M3(D)t is the composite with F of all cyclic 
cubic extensions of Q and is the maximal abelian (over Q) subfield of M3(D), and 
M23(D)~ is the composite of all S3-extensions of Q having F as quadratic subfield. 
It then follows that K € ¥(D) if and only if its Galois closure M is contained in 
M;(D)~. 

Suppose that K/Q is a finite extension and let Tr : E(K) —> E(Q) denote the 
trace map. We denote by E7,(K) the kernel of the trace map. 

We record the following lemmas. 


Lemma 2.1. let E be an elliptic curve defined over Q. The following are 
equivalent: 


(i) re(K) > reQ) 
(ii) E(K) contains a trace zero point of infinite order; 
(iii) #(Er,(K)) = 00. 


372 H. Kisilevsky 


Proof. Since E(K) is a finitely generated abelian group, it follows from the exact 
sequence 
0 — E7,(K) — E(K) — Tr(E(K)) — 0 


that rg(K) = rank z Tr(E(K)) + rank z E7,(K). But since [E(Q) : Tr(E(K))] < 
oo, it follows that rg(Q) = rank z Tr(E(K)), and hence that 


re(K) = re(Q) + rankz F7,(K). 


The statement of the lemma follows. Oo 


In light of Lemma 2.1, the conjecture is equivalent to the statement that for every 
elliptic curve E /Q, and for every fundamental discriminant D, #(E7,(K)) = oo for 
infinitely many K € F(D). 

Suppose now that E is an elliptic curve defined over Q, and that K /Q is a non- 
Galois cubic extension whose Galois closure is M. Let F be the subfield of M of 
degree 2 such that M = K- F. 


Lemma 2.2. The following are equivalent: 


(i) re(K) > re(Q); 
(ii) rg(M) > re(F). 


Proof. LetG = Gal(M/Q) = (1,0) ~ S3, where t?7 = o? = 1 andto = 
o'r. Let V be a Q-vector space which has an action by G ~ $3. Then there is a 
decomposition 

V=V°@V OV", 


where V°,V%, and V? are the subspaces of V corresponding to the trivial 
representation yo of G, the non-trivial abelian representation x of G, and the 
irreducible two-dimensional representation p of G, respectively. The subspaces 
which are pointwise fixed by the groups G, (a) and (rt) are V°, V° @ V%, and 
V° @ V? respectively. Applying this decomposition to V = E(M) @z Q, we see 
that 


re(K) > rg(Q)  dimg(V° @ V*) > dimg(V°) 
© dimg(V° @ V* @ V") > dimg(V" @ V*) 
© re(M)>re(F). Oo 


3 Associated surfaces 


Let a Weierstrass model for E'/Q be given by 
y2=x+Ax+B 


with A, B € Z. 
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Now suppose that K is an extension of Q of degree 3, and that P = (a, B) isa 
point in E(K). Since [K : Q] is odd, we see that 6B € Q(q@) and therefore Q(a) = 
Q(P) = Qor K. Ineither case, there exist a,b,c € Q with B = a+ ba+ca*. We 
note that the condition that c = 0 is equivalent to the statement that P lies on the 
intersection of E with the line L,, with equation y = a + bx. This is equivalent to 
the fact that a is a root of the cubic polynomial 


fap(x) =x? + Ax+ B-(a+ bx)’. 


The discriminant of f,, is a square in Q if and only if either a root of fu.y(x) 
generates a cyclic cubic extension of K/Q, or fu.5(x) has three rational roots. This 
is the case D = 1, i.e., K € F(1), and this was treated in [F-K-K]. 

Let S z =S§S 2 (d,a, b) be the affine surface defined by the equation 


D -d? = discriminant(x* + Ax + B — (a + bx)’) 
= —27a* — 4b7a? + (54B — 30Ab)a* + (36Bb? + 24A*b — 4Ab°)a 
+ A*b* — 4A? — 27B? + 4Bb® — 18A Bb”. 


Recall that for K € F(D), M = K(VD) denotes the Galois closure of K 
and t € Gal(M/Q) the non-trivial automorphism of order 2 of M fixing K. Also 
F = Q(VD) C M is the quadratic subfield of M and let o be an automorphism of 
order 3 which generates Gal(M/F), so Gal(M/Q) = (o,t | 1? =o? = 1) ~ $3, 
andre =o, 

Suppose that (d, a,b) is a Q-rational point on S?. If a is a root of fy.,(x), and 
Bp = a-+ba, then P = (a, 8) € E(Q(q@)). Since the discriminant of fi.,(x) is 
Dd’, it follows that either Q(a) = Q or F and f,,(x) has a rational root, and two 
conjugate roots in F, or Q(a) = K is an extension of Q of degree 3, with K € 
#(D) and P € Ey,(K), a point of trace zero. Thus (d,a,b) € se (Q) determines 
either a triple of points {P, 0, O°} with Q, Q* conjugate points in E(F) and P = 
—(0 + Q*) € E(Q), ora point P € Ey,(K) of trace zero. 

Conversely, suppose that K/Q is a cubic extension with K € #(D) and that 
P é€ E;,(K) is a point of trace zero. Then the points P, P®, and P® lie ona 
line L. Since the automorphism rt fixes P and interchanges P® and P°’, the set 
{P, P°, P*} is invariant under the action of Gal(M/Q), and therefore the line 
L = Lap is defined over Q. It follows that P = (a, 8) with 8 = a + ba and with 
a,b € Q. Similarly, if we have a triple of points {P, 0, O"} with QO, Q* conjugate 
points in E(F) and P = —(Q + Q*) ©€ E(Q), then this determines a rational line 
La» intersecting F in these three points. 

Therefore we have the following result: 


Proposition 3.1. The Q-rational points of SP are in one-to-one correspondence 
with rational lines intersecting E either ina triple of points {P, 0, Q*} with QO, O° 
conjugate points in E(F) and P = —-(0+ Q*) € E(Q), orina point P € Ey,(K) 
of trace zero on a cubic field K € ¥(D). 
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Let E? denote the quadratic twist of E by D. If E? is given by the model 
Dy? = x? + Ax + B, then the map ¢ : E?(Q) —> E(F) which takes a point 
QO = (x, y) to #(Q) = (x, VDy) is an injective homomorphism of groups such 
that t acts as —1 on the image #(E?(Q)) C E(F). In fact 6(E?(Q)) = {P € 
E(F)|P* = —P}. 

Let x : SP —+ A? be the projection onto affine 2-space given by 2((d,a,b)) = 
(a,b) € A*. We give conditions under which we can prove that 2(S 2 (Q)) is Zariski 
dense in A. 

Suppose first that D > 0. 


Proposition 3.2. Suppose that D > 0. Assume either that rank z Ey,(Ko) = 1 for 
some Ky € F(D) or that both rg(Q) = 1 and rgo(Q) = 1. Then the image of 
Q-rational points ae (Q) under x is Zariski dense in A’. (In fact, a(S? (Q)) is 
dense in the usual topology in an open subset of A?(R).) 


Proof. Suppose first that Ko € ¥(D) is a cubic field and that Po € Ey,(Ko) is a 
point of trace zero and of infinite order. Let Mo be the Galois closure of Ko. Since 
Tr(Po) = 0, it follows that P = Py and P’ = PY generate a subgroup of E(Mo) 
of rank 2 over Z which is stable under Gal(Mo/Q). Then for any n € Z, the line 
joining nP,nP’ also contains nP °° and so is defined over the rational field. In the 
case that both rz(Q) > 1 and rgn(Q) > 1, let Mp) = F = Q(VD). If Po and P, 
are points of infinite order on E(Q) and E? (Q) respectively, let P = Py + $(P)) 
and P’ = Po — $(P,) in E(Mo). Then P’ = P*, and P and P’ again generate a 
rank-two subgroup of E (Mo) which is stable under Gal(Mo/Q). In this case for any 
n € Z, the line joining nP,nP’ contains —2n Py € E(Q). Therefore it intersects E 
in two conjugate points of E(F) and a point in E(Q) and so is also defined over the 
rational field. 

The assumption that D > 0 ensures that Mp is a totally real field. 

Let Ag C C be the period lattice associated to EF and let 


C/Ag — E(C) 
z(mod Ag) +> P = (9(z), '(z)) 


be the analytic parametrization of EF. 
Let O(D) denote the interior of the set 2(S 4 (R)), 


O(D) = {(a,b) € A?(R) | discriminant( f,,,(x)) > 0}. 


Then O(D) consists of those (a,b) € A?(R) such that the line L,) intersects E 
in three distinct real points, and is a non-empty open subset of R?. Let O(D)t ¢ 
O(D) be the set of points corresponding to lines L, intersecting E(R) at three 
(distinct) real points P;, P2, and P3, on the connected component of the identity. 
Then O(D)* is open in O(D). 

Let z and z € R be real lifts of P and P’ with respect to the analytic 
parametrization, and let @; be a (non-zero) real period of E. Since P and P’ are 


Ranks of elliptic curves in cubic extensions 375 


independent over Z, it follows that z,z’, and w, are Q-linearly independent in R. 
Kronecker’s theorem then implies that given any x, y € Rand e > 0, there are 
integers m,n,,n2 € Z such that |x —mz— nj q@,| < € and |y — mz’ — n2a\| < € 
simultaneously (see Hardy & Wright Theorem 442, [H-W]). It follows that the 
multiples of (z,z’) (mod A%,) are dense in R*/(Zq,)*. Therefore given any two 
points Q;, Q> in the eonned component of the identity of F(R) there is an 
integer m € Z such that mP and mP’ are arbitrarily close to Q; and Qo, and 
so the line joining Q; and Q> can be approximated arbitrarily closely by the line 
joining mP and mP’ which is defined over Q by the above. It now follows that 
m(S z (Q)) is dense (in the Euclidean topology) in O(D)tand is therefore Zariski 
dense in A. oO 


Suppose now that D < 0. In this case the above proof fails because we cannot 
apply Kronecker’s theorem. We appeal to the following: 


Definition 3.1. (Density hypothesis.) Let E /Q be an elliptic curve defined over the 
rational field Q. Suppose that P is an algebraic point on E(C) such that no integer 
multiple of mP is real (mP = mP) or purely imaginary (i.e., mP = —mP), and if 
E has CM, then P is not the division of a real or a purely imaginary algebraic point 
by any (complex) endomorphism. Then the cyclic subgroup of E(C) generated by 
P is dense (in the usual topology) in the complex points E(C) of E. The density 
hypothesis can be restated more compactly as follows: Suppose P € E(Q) is such 
that A(P) ¢ E*+(C) for any 0 # A € End(£). Then the cyclic subgroup generated 
by P is dense in E(C). Here E*+(C) and E~(C) are the subgroups of E(C) on 
which complex conjugation acts by +1 and —1 respectively. 


In this regard, Waldschmidt had already introduced the “density property” for 
commutative algbraic groups in [Wal] and [Wa2]. He has shown for elliptic curves 
E defined over Q, which are not isogenous to their complex conjugates, that a 
subgroup of rank > 3 of E(Q) is dense in E(C). He has also pointed out that the 
density hypothesis above follows from the “elliptico-toric” conjecture of Cristiana 
Bertolin [Be]. 


Proposition 3.3. Suppose that D < 0. Assume either that rank z Ey;(Ko) => 1 for 
some Ky € ¥(D) and that the density hypothesis holds (and in the case that E has 
CM, we assume also that Q(/D) is distinct from F! = End(E) ®z Q), or that both 
re(Q) = Land rgo(Q) = 1. Then the image of Q-rational points SP (Q) under x 
is Zariski dense in A?. (In fact, w(SP (Q)) is dense in the usual topology in an open 
subset of A?(R).) 

Proof. Suppose that Po € Ey,(Ko) is a point of infinite order for some cubic field 
Ko € #(D). We may suppose that Po is a real point and that its conjugates Py 


and pe are complex conjugate points. Then we show that P = Pj satisfies the 
conditions of the density hypothesis. We note that the subgroup of E(Mo) generated 
by {Po, Py. Pe } is a group of Z-rank 2 with the relation Pp + Py + Pe =0.Ifa 
multiple of pe were real or purely imaginary, there would be a cena ideperiden 
relation of the form mPy = +mPy *. Since this would imply that the group had 
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Z-rank at most 1, it follows that Py) ¢ E*(M) for any 0 4 m € Z. Furthermore, 
if E has CM by the quadratic field F’ # F, let M’ = My- F’, so that 


Gal(M’/Q) ~ Gal(F’/Q) x Gal(Mo/Q) = (t’) x (z,0). 


Then t fixes Ko F’ and complex conjugation on M’ is given by tt’. Now Po € 
E(Ko) is a real point and its conjugates Py and Py * are complex conjugate points 
in E(M’). Suppose that A( Po) € E*(M’) for some endomorphism A € End(E) ~ 
Or. Then £A(PF) = A(Py) = Meo), so that applying o”, we see that A( Po) = 
+1 (P2). Since A( Po) + A(P2) + A(Po’) = 0 we find that 


(A4A)(P2) + A(P) = 0. 
Applying t and adding, we see that 
(2A + A)(PY + PZ) = 0. 


But since Py + Po = —P»y has infinite order, it follows that 2A + A = 0, and 
therefore 1 = 0. Hence the density hypothesis implies that the multiples of PY are 
dense in E(C). 

In the case that both rg(Q) > 1 and rgn(Q) = 1, then E(Q) is dense in the real 
points E and ¢(E?(Q)) is dense in the purely imaginary points of E and so their 
sum E(F) = E(Q) + d(E?(Q)) is dense in E(C). 

As before, let (a,b) = m((d,a,b)) € A?(R) for some (d,a,b) € 2(SP(Q)). 
If La» is the corresponding line, then L,., intersects E in one real point (P), say) 
and a pair of conjugate complex points (Q; and Q,). Then the density hypothesis 
assumed above implies that there is a point Q either in E(Mo) or in E(F) arbitrarily 
close to Q;, and so Q* is close to Q, and hence the line Lo joining Q, Q* and 
—(Q + Q°) is close to Lz». But since the set {0, 0',-(QO + Q*)} is stable 
under the action of Gal(Mo/Q) or Gal(F'/Q), the line Lo is defined over Q and 
so m(S 2 (Q)) is dense (in the Euclidean topology) in O(D) and is therefore Zariski 
dense in A’. Oo 


We can now prove Conjecture 1 under the assumptions of Propositions 3.2 
and 3.3. 

For fixed b € Q, the fibre over b, Xzy = Xxgy(d,a) is a curve of genus at 
most |. Our strategy is to try to find b € Q such that Xp is either an elliptic curve 
of positive rank or a rational curve with infinitely many Q-rational points. We then 
show that for almost all such 5, only a finite number of rational points on any Xz p 
can correspond to a given cubic extension K € Y(D), so in the event that X¢,(Q) 
is infinite, we obtain the conclusion of Conjecture | that there is an infinite number 
of fields K € F(D) for which rg(K) > re(Q). 

Recall that a Q-rational point Q = (d,a,b) € SP(Q) corresponds to the three 
roots a, @1,@2 (distinct if d 4 0) of the polynomial f,.,(x) = x7 + Ax + B— 
(a + bx)?. In addition there are two possibilities: either one of the roots is rational 
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and the other pair are conjugate elements in Q(/D), or Q(a), Q(a1), and Q(az) 
are conjugate cubic fields in .¥(D). 
The following results appear in Section 3 of [F-K-K]. 


Proposition 3.4. Fix a field K for which Faltings’ theorem holds. Then for any 
fixed b # 0, there are only finitely many a € Q such that the polynomial fap(x) = 
x? + Ax + B—(a+bx)* has three distinct roots in K. 


Proof. Fix b. Suppose that 
fap (x) = x2 + Ax + B—(a+ bx) = (x —a1)(x — a2)(x — 3) 
with a@),Q@2, anda3 € K. Hence a, a2, and a3 satisfy the following equations: 


xX, +x +23 = b?: 
XX. + X2xX3 + .x3x, = A —2ab; 


X1xX9x3 = a? — B. 


This is a system of three equations in the four variables x1, x2,.x3, and a which 
defines a curve in (say) x; and x2. A (Maple) calculation shows this to be a curve 
Cy’ = Cy'(x1, X2) of degree 4 in both x; and x2 for which an affine equation is 
given by 


Cy! 10 =— xf + (—2x2 + 2b”)x} + (—3x5 — b* —2A)x? 
+ (—2x2A + 2x2b* + 2b7A — 2x3) x1 
— A? —2x3A — x} + 2xob7A — x34 + 2x3b? + 4Bb?. 
For generic values of b, the curve C;’ has genus 3 . The set of b for which 


the genus can be less than three is finite and is a subset of the set of roots of the 
polynomial (in 5) 


=b'Q7BR* + 44°\(—274 + 186°A + 108 Bb" +5") 


(the ramification locus of C,’). The root b = 0 gives rise to Co’ : 0 = —(xj + 
X1X2 + x} + A)?. The roots of —27A” + 18b4A + 108Bb? + b® are distinct for 
all A and B with 27B? + 4A? # 0, and correspond to the slopes of the tangents 
to E at the non-trivial 3-torsion points of E and result in curves Cy’ of genus 2. It 
follows that for all b 4 0, C;’ has genus at least 2, and so by Faltings’ theorem 
there is only a finite number of K-rational points (x, x2) € Cy’(K). Therefore, for 
b # 0, the number of a such that f,,(x) has three (distinct if d 4 0) roots in K 
is finite. Oo 


Proposition 3.5. Fix bo € Q, bo 0, and let K/Q be a fixed finite extension. Then 
there is only a finite number of points Q = (d,a,bo) € SP (Q) for which the points 
in E(Q) corresponding to Q (by Proposition 3.1) are defined over K. 


Proof. This follows immediately from Proposition 3.4. 
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We now prove the following: 


Theorem 3.6. Assume either that rank z Ey,(Ko) > 1 for some Ko € ¥F(D) or 
that both rz(Q) = 1 andrgo(Q) = 1. If D < 0, assume also the density hypothesis 
in the first case (and in the case that E has CM, we assume also that F is distinct 
from F’ = End(E) @z Q). Then rank z E(K) > rank z E(Q) for infinitely many 
cubic fields K € F(D). 


Proof. Let y : SP —> A! denote the map taking Q = (d,a,b) € S? to ¥(Q) = 
b with fibres, the curves X¢,, generically of genus one. The desingularization 
g2 of S 2 is obtained by blowing up (twice) the nine rational double points of 
S 2 , corresponding to nine 3-torsion points of L. Let J be the associated Jacobian 
fibration with corresponding map w’ : J —> A!. Then for generic b € Q, w’—!(b) 
is an elliptic curve defined over Q which is isomorphic over Q to X¢4 if Xzp 
has a Q-rational point. There is a degree-four map defined over Q (see [F-K-K], 
Lemma 5.5) from f : oe —> J which is a dominant map. By Propositions 3.2 
and 3.3, SP (Q) is Zariski dense in $?, so FS? (Q)) is Zariski dense in J since f 
is dominant. By Mazur’s theorem [Ma] on the boundedness of the torsion over Q, 
the set of Q-rational points of J which are torsion points on the fibres is a Zariski 
closed subset of J. Since f Ge (Q)) is Zariski dense in J, there are infinitely many 
Qe Se (Q) such that f(Q) is a non-torsion point in infinitely many fibres in J. For 
such a point QO, y—!(Q) is isomorphic over Q to the elliptic curve w’~!(Q), which 
contains a Q-rational point of infinite order. Thus, y—!(Q) also contains infinitely 
many Q-rational points. It follows that for infinitely many b € Q, the curves X £5, 
have an infinite number of rational points. Proposition 3.4 then implies that there is 
an infinite number of fields K € F(D) for which rg(K) > re(Q). Oo 


Remark 3.1. In [F-K-K] we use an alternative argument proving the Zariski 
density of Propositions 3.2 and 3.3 and we get the same result without appealing 
to the density hypothesis. 


4 Rational three torsion 


If E has a 3-torsion point rational over Q, then F has a Weierstrass equation of the 
form E : y? + 3uxy +ty = x3 with u,v € Q (see Knapp, p. 146, [Kn]). In this 
case, the surface S$ e can be described by the affine equation 


Dd? = discriminant(x* — (a + bx)? — 3u(a + bx)x —t(a + bx)). 


Then se = ae (d,a, b) has a degenerate fiber over b = —3u which is the rational 
conic X¢,-3,(d,a), 


Da? = -27(a +t) (a? + 4ua + 4tu’), 
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and which becomes 
D2 + 3w* = 12u° — 12tw 


in the variables z = d/3(a +t) andw =a + 2u’. 
This conic has a rational point if and only if 3(u*—ur) is anorm from Q(/—3D). 
If (wo, Zo) is a solution to this norm equation, we obtain a parametrization given by 


a =a(r) = w—2W = (Dr?wo — 2Drz — 3wo — 2u° Dr? — 6u°)/(Dr? + 3) 


and 
d = —3(Dr?z + 6wor — 3z0)(a(r) + t)/ + Dr’). 


Substituting this value a = a(r) into the original elliptic curve gives the family 
of cubic equations 


(a + bx)? + 3ux(a + bx) + t(a + bx) = x°, 


or 
x3 + 3u(t + a)x —a(a +t) =0, 


whose root a = a(r) is the x-coordinate of the point P = (a,a + ba) € 
E(Q(a(r))). These expressions provide points of infinite order on the parametrized 
family of cubic fields K, = Q(a(r)) € F(D) for rational values of r. 

In the case D = —3 the norm condition is trivially satisfied and we obtain points 
of infinite order on a parametrized family of fields K, of the form K, = Q(m3), 


where 
2(r + 1)(r —1)° 
n= 


tr—t+2u3 
and where ; 
_=27=—1=um?) 
m3 
and 
4u3 — t(r — 1)* 
= ——$$—$—<$<$_@_ 3ux. 
r2—1 


Having such parametrized families allows us to give lower bounds for the number 
of K € ¥(D) (counted by increasing discriminant) over which E acquires points. 
See [F-K-K] §7 for such results for certain elliptic curves E/Q and for the family 
F (1). 


5 Theorems of T. Dokchitser and V. Dokchitser 


In this section we take note of some work of T. Dokchitser [DoT] and V. Dokchitser 
[DoV] which tends to support Conjecture 1. In fact, the result of T. Dokchitser 
provides a proof of Conjecture | for D = —3. 
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Let E be an elliptic curve defined over Q and suppose that L/Q is a Galois 
extension. Let p be an Artin representation of Gal(L/K) = G, and let p* denote 
the contragredient representation. V. Dokchitser [DoV] has computed the global root 
number of L(E,s, p) (the twist of the L-function L(E/Q,s) by p) when p ~ p* 
and no prime of additive reduction for E is bad for p. 

He shows (Corollary 2, [DoV]) that if p is the irreducible 2-dimensional Artin 
representation associated to K € F(D), D # 1, andif EF is an elliptic curve defined 
over Q whose conductor Ng is coprime to D, then 


; D 
WE = Sign(D) wel 
E 


where wW,,£ is the sign of the functional equation for L(E/Q, s, p) and where (:) is 
the Jacobi symbol. 

This implies that for an elliptic curve E'/Q of conductor Nz, if D 4 1 is coprime 

to Nz, there are infinitely many K € F(D) for which wp, - = —1, and so the Birch 
& Swinnerton-Dyer conjecture would then predict that for such fields K we have 
re(K) > reQ). 
In particular, he shows that for E = Ej, (the elliptic curve over Q of conductor 
19 labelled 19A in Cremona [Cr]), L(E/Q, 5, Pm) = 0 for every m prime to 19, 
where p,, is the irreducible 2-dimensional Artin representation associated to the 
field Q(m3). Since r¢,,(Q) = 0, the Birch & Swinnerton-Dyer conjecture predicts 
that rz,., (Q(m3)) > 1 for all integers m. 

T. Dokchitser [DoT] proves the following theorem: 


Theorem 5.1. (Dokchitser, T.) For any number field k and elliptic curve E defined 
over k there is an infinite number of cubic extensions K = k(a3)/k, a €k, such 
that re(K) = re(k). 


This provides a proof of Conjecture | for D = —3. 


6 Heegner points 


In this section we follow a suggestion of Darmon to use Heegner points to 
verify Conjecture | in certain cases. Fix an elliptic curve E/Q of conductor Ng 
and fix a negative fundamental discriminant D < 0 with D and N; relatively 
prime. Then F = Q(/D) is an imaginary quadratic field, and we denote by 
Or and ©, the maximal order of F and its suborder of conductor c,c € Z 
respectively. We use the notation of Darmon [Da], and let sign(Z, F’) be the sign 
of the functional equation satisfied by L(E/ F,s). Then we use the following ({[Da] 
Theorem 4.18): 
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Theorem 6.1. Let E be a semistable elliptic curve of conductor Ng, and F an 
imaginary quadratic field of discriminant D prime to Ng. If sign(E, F) = —1, 
then there is a non-trivial Heegner system { P,,} attached to (E, F). 


We will need the following lemma, which is proved in [K-S], Lemma 2.1. 

Let L be a number field, p a finite prime of L, J, the group of fractional ideals 
prime to p, Py, the group of principal fractional ideals in J,, Py the group of 
principal fractional ideals in P, generated by elements congruent to 1 mod p. Then 
Cy = Ip/Pp is the class group of L, Cr.» = Ip/ Pp, is the ray class group with 
conductor p, and P; = P,/P,, is the principal ray with conductor p. We have a 
short exact sequence 


1— Py — Cry — Ci — 1. (1) 


Let g be a prime, and consider the exact sequence of g-primary components 


bas ie 1. (2) 

We are interested in primes p for which the sequence (2) splits. Let aj,...,d5 € 

I, be such that their images q; in ce form a basis of the finite abelian g-group a 
Let g’"' be the order of a;,i = 1,...,s. Then qf = (a;) € Py, i = 1,...,5. 


Let Ly = L(Egm, “X/aj|1 < i < s) with Cj» a primitive q’’th root of unity, and 
m = max{1,m,,...,ms}. 


Lemma 6.2. (Splitting lemma) In order that the sequence (2) split, it is sufficient 
that p split completely in Ly. 


Let E be a curve satisfying the hypotheses of Theorem 6.1, and let {P,,} be the 
(non-trivial) Heegner system attached to (EF, F). 


Theorem 6.3. Let E be an elliptic curve of conductor Ng, and F an imaginary 
quadratic field of discriminant D prime to Ng. If there is a non-trivial Heegner 
system {P,,} attached to (E, F), and if L'(E/F,1) 4 0, then there is an infinite 
number of cubic fields K € ¥(D) for which rg(K) > re(Q). 


Proof. Let {P,,} be a non-trivial Heegner system attached to (FE, F'),so P, € E(H,) 
where H, is the ring class field over F of conductor n (n prime to Nz.) The Galois 
group G,, = Gal(H,,/F) is described by the exact sequence 


(OF / n Or )* 
(Z/nZ)*(Or)* 
where Cf is the ideal class group of the ring of integers Or of F. Taking g = 3 and 


L = F, the splitting lemma shows that there is set S of positive density of primes 
£ € Z satisfying the following properties: 


(3) 0— 


— G, — Cr — 0, 


£ = 1 (mod 3); 


the 3-primary part of the exact sequence (3) splits forn = £; 


£-Opr =(-( splits into principal ideals in Or. 
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In addition, since the set of primes p € Z for which ap(E) = 2 has density 0 
(Serre [Se]), we may also suppose that the primes £ € S have ag # 2. For such 
primes ¢, Hy contains a subfield M; which is a cyclic cubic extension of F, disjoint 
from the Hilbert class field H,, and Galois over Q with Gal(M;/Q) ~ S3. Then 
from the properties of Heegner systems (cf. Darmon, Proposition 3.10, [Da]) we 
have 

Traceq,/x,(Pe) = (ae — 01-0, ') Pi = (ae — 2) Pr, 


since [ is a principal ideal in O;. Therefore 
Tracey, /r (Pc) = (ae — 2) Tracey, /r(P1). 


If we set Qe = Tracey,/, (Pe), then Tracey,;r(Q¢) = Tracey,;r(P1). Since we 
assumed that L’(E/F, 1) # 0, the Gross—Zagier theorem implies that Trace y, ;- (P1) 
has infinite order. Hence Og € E(M,) is also a point of infinite order. Finally, if 
we had rg(M:) = re(F), then Gal(M;/ F) would act trivially on E(M_:) modulo 
torsion, and we would have that Q, is the 3-division of a point of E(F) modulo 
torsion. Since the torsion in a cubic extension of F is bounded, this would account 
for only a finite number of fields. But the M; are distinct for different primes £, so we 
see that there must be infinitely many fields My/F for which re(M_) > re(F). By 
Lemma 2.1, we conclude that there is an infinite number of cubic fields Ke € F(D) 
for which rg(K¢) > re(Q). Oo 


Corollary 6.4. Let E be a semistable elliptic curve of conductor Ng, and F an 
imaginary quadratic field of discriminant D prime to Ng. If sign(E, F) = —1, and 
L'(E/F,1) 4 0, then there is an infinite number of cubic fields K € #(D) for 
which re(K) > re(Q). 
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On effective equidistribution of expanding 
translates of certain orbits in the space of lattices 


D. Y. Kleinbock and G. A. Margulis 


In memory of Serge Lang 


Abstract We prove an effective version of a result obtained earlier by Kleinbock 
and Weiss [KW] on equidistribution of expanding translates of orbits of horospher- 
ical subgroups in the space of lattices. 


Key words homogeneous flows ¢ exponential mixing * equidistribution 


Mathematics Subject classification (2010): 37A17; 37A25 


1 Introduction 


The motivation for this work is a result obtained recently in [KW]. Fix m,n € N, 
setk = m+n and let 


Im Y 


G = SL;(R), T = SLx(Z), uy = & : 


i: H= {uy|Y € Munn}, (1.1) 


where M,,,, stands for the space of m x n matrices with real entries. Then H is a 
unipotent abelian subgroup of G which is expanding horospherical with respect to 


g,=diasle’™ 277, 6°" 2.8 120 (1.2) 
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The latter, by definition, means that the Lie algebra of H is the span of eigenspaces 
of Ad(g;), ¢ > 0, with eigenvalues bigger than 1 in absolute value. 


The space X ¢ G/T can be identified with the space of unimodular lattices in 
R*, on which G acts by left translations. Denote by z the natural projection G > 
X, g + gI, and for any z € X let 7, : G > X be defined by z,(g) = gz. Also 
denote by jz the G-invariant probability measure on X and by yz the Haar measure on 
G such that 24,2 = jt. Fix a Haar measure v on H. Note that the H -orbit foliation 
is unstable with respect to the action of g;,¢ > 0. It is well known that for any Borel 
probability measure v’ on H absolutely continuous with respect to v and for any 
z € X, g;-translates of (,),v’ become equidistributed, that is, weak-* converge 
to “ as t — oo. An effective version of this statement was obtained in [KM1, 
Proposition 2.4.8]. In order to state that result, it will be convenient to introduce 
the following notation: for f ¢ L'(H,v), a bounded continuous function yw on X, 
z€ X and g € G define 


intea= [ FAW (ghz) dv(h). 


In other words, I /y,(g, z) is the result of evaluation of the g-translate of (7,).v’ at W, 
where dv’ = f dv. Then equidistribution of g;-translates of (z,)4v’ amounts to the 
convergence of I /y,(g;,Z) to iy f: ty w as t — oo (unless it causes confusion, we 
will omit measures in the integration notation for the sake of brevity). 

The following is a slightly simplified form of [KM1, Proposition 2.4.8]: 


Theorem 1.1. There exists y > 0 such that for any f € CSp(H), W © CQinp(X) 


and for any compact subset L of X there exists a constant C = C(f,w, L) such 
that for all z € L andany t = 0 


ivtena- for fv 


The proof used the exponential decay of correlations of the G-action on X (called 
“condition (EM)” in [KM1]). See Section 2 for more detail. 

Motivated by some questions in simultaneous Diophantine approximation, the 
first named author and Barak Weiss considered translates of H-orbits on X by 
diagonal elements of G other than g;. Specifically, following [KW], let us denote 
by at the set of k-tuples t = (t),..., 4) € R* such that 


ace". (1.3) 


m 


n 
ti,...,t >0 and y= en 
j=l 


i=1 


and for t € a+ define 


def ,. _ _ 
g = diag (Cee 6" ae) eG (1.4) 
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and 


(the latter, roughly speaking, measures the distance between t and the walls of the 
cone at C R*). 
The theorem below is a reformulation of [KW, Theorem 2.2]. 


Theorem 1.2. For any f € L'(H,v), any continuous compactly supported W : 
X — R, any compact subset L of X and any ¢ > 0 there exists T > 0 such that 


iviea- for fv 


forallzé Landteat, |t|>T. 


=f 


That is, gt-translates of H -orbits become equidistributed as [t| — oo uniformly 
in z when the latter is restricted to compact subsets of X. The proof relies on 
S.G. Dani’s classification of measures invariant under horospherical subgroups and 
the so-called linearization method. The purpose of the present paper is to prove an 
effective version of the above theorem: 


Theorem 1.3. There exists y > 0 such that for any f € CSnp(H), W © CQinp(X) 


Cc 


and for any compact L C X there exists C= Cy w, L) such that for all z € L 


and allt € at 
ivtena- | tfv 
H Xx 


Note that the above statement follows from Theorem 1.1 when k = 2, that is, 
G = SL,(R), but is new for k > 2. The proof uses the “exponential mixing” 
approach of [KM1, KM3] together with effective nondivergence estimates obtained 
in [KM2]. We will describe these two parts Sections 2 and 3 respectively, and then 
proceed with the proof of Theorem 1.3 in Section 4. We remark that the method 
of proof readily extends to the set-up more general than (1.1). Note also that, as 
observed by N. Shah in [S, Remark 1.0.2], Theorem 1.3 can be used to strengthen 
one of the main results of [KW], that is, [KW, Theorem 1.4], which constitutes a 
diophantine application of Theorem 1.2. 


< Cell, 
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2 Exponential mixing and g,-translates 


Notation: We will fix a right-invariant metric dist on G, giving rise to the 
corresponding metric on X. B(x,7r) will stand for an open ball of radius r centered 
at x. If a metric space is G or its subgroups, we will abbreviate B(e,r) to 
B(r). When necessary, we will use subscripts denoting the ambient metric spaces. 
|| - lle and || - ||ce will stand for the (2, 2)-Sobolev and C! norms respectively. 
We define 


W*(X) = {fh € C™(X): |W lle < 00 VL € N}; 


clearly C&,,(X) C W(X). In fact, W*-°°(X) coincides with the set of functions 
w € C™(X) such that Dy € L?(X) for any D from the universal enveloping 
algebra of Lie(G). We let (-,-) stand for the inner product in L?(X). We also denote 
by ||¥|lLip the Lipschitz constant of a function y on X, 


ales 2 su Iv¥@) — vO 
. x,yEX, xAy dist(x, y) : 


‘ def 
and let Lip(X) = {wv : ||W|lLip < oo}. 


The following property of the G-action on X is deduced in [KM3] from the 
spectral gap on L?(X): 


Theorem 2.1 (KM3, Corollary 3.5). There exist y > 0 and £ € N such that for any 
two functions 9, W € W(X) and for any t > 0 one has 


teow - foo fv] <tolelvte ere. 


Here and hereafter the implicit constants in < depend only on the dimensions of 
the corresponding spaces and the choices of the metric. Taking g = g; as in (1.2), 
it follows that 


eww foo f | <lotalvte-e. (2.1) 


An estimate analogous to (2.1) was used in [KM1] to derive Theorem 1.1. In 
this section we apply Theorem 2.1 to prove a statement similar to Theorem 1.1, 
providing some information as to how C in (1.3) depends on f and L. The argument 
follows the lines of the proof in [KM1]; in fact, the statement below is basically 
an intermediate step in the proof of [KM1, Proposition 2.4.8]. However we have 
decided to include details for the sake of making this paper self-contained. 
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To pass from (9, W) to J py (gr, z), we need to thicken f into a suitable function 
gy on X. To explain this process, we need to introduce some more notation. Let 


HT = ie 1) € Merl 


and 


H® = & >) A €GL,,(R), B € GL,(R), det(A) det(B) = i 4 
The product map H~ x H° x H > G isa local diffeomorphism; we will choose ro 
so that the inverse of this map is well defined on Bg(ro). Note that H~ is expanding 
horospherical with respect to g_;,t > 0, while H° is centralized by {g,}. Thus, the 


inner automorphism ®,; of G given by ®,(g) = g,h(g;) | is non-expanding on the 
group 


A® A-H° = ie 3) 


~ (\Y B 
in fact, one has 
Vr>0Vt>0 ®, (By (r)) C By(r). (2.2) 


Let us choose Haar measures v~, v? on H~, H°® respectively, normalized so 
that jz is locally almost the product of v~, v° and v. By the latter, in view of 
[B, Ch. VII, §9, Proposition 13], we mean that jz can be expressed via v—, v° and v 
in the following way: for any g € L'(G) 


/ le) due) = iu g(h- hh) A(h®) dv (7) dv°(h°) dv(h), 
H-H°H H-xH9°xH 
(2.3) 


where A is the modular function of (the non-unimodular group) H. 
The “thickening” will be based on the following properties of the Sobolev norm, 
cf. [KM1, Lemma 2.4.7]: 


Lemma 2.2. (a) For any r > 0, there exists a nonnegative function 0 € 
Ca (R*) such that supp(@) is inside B(r), fgv@ = 1, and \6\le < 
pF N/2). 


(b) Given 0, € Coes 6, € CX (RY), define 9 € CX. (IR) by O(x) = 


com comp comp 
91(x)@2(x). Then ||9|le < ||: llell@allce- 
(c) Given 0; € Cre), 0) € Cth), define 0 € coe) by 


O( x1, X2) = O1(X1)62(x2). Then Olle < |: llell Palle. 
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We will apply the above lemma to functions supported on small enough balls 
centered at identity elements in G, H, H o AO. 


Theorem 2.3. Let f € CQy,)(H), 0 <r < 1ro/2 and z € X be such that 


(i) supp f C Br(r), and 
(ii) mz is injective on Bg(2r). 


Then for any v € W?°(X) 1 Lip(X) with Jy WV = 0 there exists E = E(y) such 
that for any t = 0 one has 


pte a) l= E(r i il +r OHH ye”), (2.4) 
H 


where y and £ are as in Theorem 2.1 and N = m* +mn +n? —1=dim H. 


Proof. Using Lemma 2.2, one can choose nonnegative functions 0~ € CQ,)(H_), 
0° € C2. (H”) with 


comp 
6 = e° =1 (25) 
H- H? 
such that 
supp(9) - supp(9°) C By(r), (2.6) 
and at the same time 
Olle << ret Ns), (2.7) 
where 6 € Con ) is defined by 


def 


6(h-h°) = 0-(h-)0°(h°) ACh)! 
Also define gy € C&,,(X) by g(h-h°hz) = 6(h-h°) f(h); the definition makes 
sense because of (2.6) and assumptions (i), (ii) of the theorem. Then J ¢y,(g;, z) can 
be reasonably well approximated by (g,9, v”) = (9, g-1): 
Zev (81.2) — (G8 -)| 


/ F(A) W(g:hx) dv(h) — i; 5 nh) fn gh Whx) dy(hh°h) 
H G 


(2.3) | 


| / 8 (n-YO°(H) f(A) (W(gehx) — We(@(A-A®)ghx)) ACH)! dh Wh) 
G 


(2.5) 


= sup wisx)— von! f |0~ (h-)0°(h°) F(AYACA®) "| duh hh) 
(2.2), (2.6) g€Bq(r). yeX G 


< [vir f i'l. 
(2.3) H 
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On the other hand, in view of Lemma 2.2 and z, being a local isometry, 

llolle = 18 - flle <A llell fle < OF le, 
hence by (2.1) 


Kare. WL KOM TF elle, 


finishing the proof. Oo 


Remark 2.4. In order to derive Theorem 1.1 from Theorem 2.3 it suffices to 
choose r = e~*' for some suitable 6. The same trick will help us in the proof 
of Theorem 1.3. Note that ¢ needs to be taken large enough so that condition (ii) of 
Theorem 2.3 is satisfied for all z ¢ L. The latter is possible because, in view of the 
compactness of L and discreteness of I’ in G, the value 


r(L) = inf sup{r > 0 | 2, : G — X is injective on B(r)} 
ze 


is positive; we will call it the injectivity radius of L. 


Remark 2.5. It is worthwhile to point out that H being the expanding horospheri- 
cal subgroup relative to g;, t > 0, was crucially important for the proof. When g; is 
replaced with g; where t is an arbitrary element of a*, one can still talk about ®,, 
the inner automorphism of H given by 


O(h) = geh(gi) (2.8) 


It is expanding on H, since the latter is contained in the expanding horospherical 
subgroup relative to g¢; however it is not non-expanding on H in the sense of (2.2); 
thus there is no guarantee that J ¢y(g;, z) is close to (gy, g-,W) for g constructed as 
in the above proof. We bypass this difficulty by means of an additional step, based 
on the nondivergence phenomenon, to be described in the next section. 


3 Quantitative nondivergence 


For any ¢ > 0 consider 
Ke 2x({geG|llgvl>e VveZ*~ {o}}). 


In other words, K, consists of lattices in R* with no nonzero vector of length less 
than e. These sets are compact by virtue of Mahler’s Compactness Criterion (see 
[R, Corollary 10.9] or [BM]). Here || - || can be any norm on R* which we will from 
now on take to be the standard Euclidean norm. 
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It was proved in [KM2], refining previous work on nondivergence of unipotent 
flows [M, D], that certain polynomial maps from balls in Euclidean spaces to X 
cannot take values outside of K, on a set of big measure. Namely, the following 
is a special case of [BKM, Theorem 6.2] (see also [KLW, KT, K] for further 
generalizations): 


Theorem 3.1. Ford €N, let y be amap R4 — GL,(R) such that 


(i) all coordinates (matrix elements) of p(-) are affine (degree | polynomials), and 
let a ball B C R4 and 0 < p < 1 be such that 
(ii) for any j = 1,...,k — 1 andany we /\! (Z*) ~ {0} one has 


lo(x)w|| => pe forsome x € B. 


Then for any positive € < p one has 


2 \ '/ak-D 
A ({x € B | 2(y(x)) ¢ Ke}) « (=) A(B). (3.1) 


Here A is Lebesgue measure on R¢, and the Euclidean! norm || - || is naturally 
extended from R* to its exterior powers. We remark that the way assumption (i) 
is used in the proof is by verifying that all the functions x +> ||g(x)w||, where 
we /\/(Z*), are (C,a)-good on R“, with some fixed C = C(d,k) > 0 and 
a = 1/d(k — 1), the exponent appearing in (3.1). See [KM2] for more detail. 

Our plan is to apply Theorem 3.1 with g : M,,, — G given by 


Q(Y) = gtuyg (3.2) 


for some g € G andt € a’. It is clear that assumption (i) holds. As for (ii), 
we will need to have uniformity in t € at and in g such that z(g) belongs 
to a compact subset of X. This can be extracted from the next lemma, which 
is immediate from [KW, Proposition 2.4] applied to the representations of G on 
AJ (RB), fj =1,...,k-1: 


Lemma 3.2. There exists a > 0 with the following property. Let B be a ball 
centered at 0 in Mm». Then one can find b > 0 such that for any j =1,...,k —1, 
any w € /\/ (R*) and any t € at one has 


sup || guy wl] > be! Iw]. 
YeB 


In [KM2] the statement of Theorem 5.2 involved the sup norm instead of the Euclidean one, which 
resulted in a restriction for p to be not greater than 1/k; thus we chose to refer to [BKM] for the 
Euclidean norm version. 
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Corollary 3.3. Let B be a neighborhood of 0 in Mm» and let L C X be compact. 
Then there exists b > 0 such that for any j =1,...,k —1, anywe /\/ (Z‘) < {0}, 
any g € m~'(L) andany t € a* one has 


sup || guy gw| > belt! 
YeB 


Proof. Apply the above lemma with w replaced by gw; it follows from the 
compactness of L and discreteness of /\/ (Z*) in /\/ (R*) that 


inf {\|gwi| | x(g) € L, we Ni (2) ~ 0}, 7 = 1... 1\ 


is positive. Oo 
Corollary 3.4. Let L Cc X be compactand let B C H bea ball centeredate C H. 


Then there exists T = T(B, L) such that for every 0 < € < 1, any z € L and any 
teat with |t| > T one has 


v({heB| ghz ¢ K.\) « emt p(B). 


Proof. Define T by be*’ = 1, where a is given by Lemma 3.2 and b by 
Corollary 3.3 applied to log(B) C M,,, and L. (Note that the exponential map 
from M,,, to H is an isometry.) Take @ as in (3.2) with g € x! (L). Then, in view 
of Corollary 3.3, assumption (ii) of Theorem 3.1, with d = mn, will be satisfied 
with p = 1 as longas |t| > T. Oo 


We conclude this section by an estimate of the injectivity radius of K,, to make it 
possible to combine the above corollary with Theorem 2.3. Observe that any lattice 
A € K, can be generated by vectors of norm < 1/e*—!; if gA = A and g # e, then 
for one of those vectors v one has ||gv — v|| > ¢. This implies that dist(e, g) >> e*. 
We atrive at 


Proposition 3.5. There exists positive c = c(k) such that r(K;) > c-e* Ve > 0. 


4 Proof of Theorem 1.3 


Our goal in this section will be to find y > 0 such that for any f € CSi,(7), 


ve W(X) Lip(X) with fy v = 0 and compact L C X there exists C > 0 
such that for all z € L and all t € a* one has 


[Zev (ge. 2)| < Ce Ve, (4.1) 


Then Theorem 1.3 will follow by applying (4.1) with w replaced by wy — /, xy V- 
Note also that, by increasing C if needed, it is enough to prove (4.1) for t with large 
enough |t]. 
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Given t € at, define r © |t|/2, and let 


u=ut) #t-(<... : meee ty, (4.2) 


’ >> 
m mn n 


Note that u € at, |u| > |[t|/2 = f, and gy = g;2u (here gy and gy are defined 
via (1.4), and g; is as in (1.2)). 

Take a function 6 supported on By (r) as in Lemma 2.2(a), with r = e~*' where 
B is to be specified later; since f, H 6 = 1 and v is translation-invariant, one can write 


Leo(aua) =f fOyw(edadvity f(y) avy) 
~ I, f(y") O(V)V (gray (hz) dv(y) dv(h) 


=| [ Feormeor(ereahe) dv) dv. 
Note that | lisa contracting automorphism of #7, in fact, one has 
dist (e, ®;'(h)) < el4l dist(e,h) < e~?! dist(e, h) 


for any h € H. Choose B = B(r) containing supp f. Then the supports of all 
functions of the form h +> f (®7!(y)h) are contained in 


Be B(r +e OFP)"), 
By taking ¢ large enough it is safe to assume that 
e Ft < 79/2, (4.3) 


v(B) < 2v(B), and also that t > T = T(B, L) as in Corollary 3.4. Now define ¢ 


by 
2 1/k 
Pe (Ze) ; (4.4) 
Cc 


where c is from Proposition 3.5, and denote 


A {he B | guhz¢ K.}. 


Then for any u € at with |u| > 7 and any z € L one knows, in view of 
Corollary 3.4, that 


v(A) & emt p(B). 
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Hence the absolute value of 


[I Ff (®y'(0)A)O() W (gi: yguhz) dv(y) dv(h) 
AJH 


FE B 
« et v(B) sup f| sup HW] f 6 <sup|f|sup|y|v(B)-e me0"” 
H 


Now let us assume that h € B ~ A, and apply Theorem 2.3 with r = e~P", guhz 
in place of z and 


def 


fily) = f (®y'O)A)OO) 


in place of f. Clearly condition (i) follows from (4.3), and, since guhz € Ke 
whenever h ¢ A, condition (ii) is satisfied in view of Proposition 3.5 and (4.4). 
Also, because ®7'|; is contracting, partial derivatives of f will not increase after 
precomposition with ®| ' and thus 


Il fale a IF llcell@lle ‘ne rm fice. (4.5) 


< 
emma 2.2(b) emma 2.2(a) 


This way one gets 


if ; i F (2! (yh)OC)¥(grvguh2) aviv) dvtn) 
< [ IT jw (grs 8uhz)| dv(h) 
BNA 
< EW) (« / tie rr OerHr fee”) »(B) 
(2.4) H 


« E(w) (sup| fe + If Ice eC N/A) y(B). 
(4.5) 


Combining the two estimates above, one can conclude that 


oa 
Lew (ge. 2)| K Cre RED! + Coe + Cye OOO N/E 


< max(C1, Coe HED! 4. Cye“P-CEFN/DB, 
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where C;,i = 1,2,3, depend on f, yw and L. An elementary computation shows 
that choosing 6 equalizing the two exponents above will produce 


7 y 
~ 14+ mnk(k — 1)(20 + N/2) 


| 


such that (4.1) will hold with Cx max(C}, C2, C3). oO 
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1 Introduction 


1.1 Background and main results 


The theory of Eisenstein series plays a prominent role in the theory of automorphic 
functions and automorphic forms. Classically, in the theory of holomorphic modular 
forms, the Eisenstein series of weight 2k (k € N, k > 2) for the full modular group 
PSL 2(Z) are defined by 


1 1 

€>(z) = = ee = iyveC, y>O0). 

2k (Z) 5 | a, (cz + dy (Z=x+iy y >0) 
(c.,d)=1 


The arithmetic significance of these series is reflected by the fact that their Fourier 
coefficients are given by certain divisor sums. 

More generally, in the theory of automorphic functions for Fuchsian subgroups 
T of the first kind of PSL2(R), Eisenstein series are defined by 


E(z,.8) = > Im(yz)° (s € C, Re(s) > 1); 


yelToo\I 


here 4g denotes the stabilizer of the cusp joo in the group T’. For s € C with 
Re(s) > 1, the Eisenstein series €(z,s) are C°°-functions in x, y. For z € C with 
Im(z) > 0, the series E€(z, s) are holomorphic functions in s as long as Re(s) > 1. 
It can be shown that the Eisenstein series €(z, s) admit a meromorphic continuation 
to the whole s-plane. The significance of €(z, 5) relies on the fact that these series 
are eigenfunctions of the hyperbolic Laplacian Ajyp for the continuous spectrum. 
The classical approach to establishing the meromorphic continuation is based on 
the explicit knowledge of the Fourier expansion of €(z, s). Other approaches rely on 
the meromorphic continuation of the resolvent kernel of Anyp or Colin de Verdiére’s 
method given in [3]. 

Observing that the series E(z,s) are associated to the cusp ioo, S. Kudla 
and J. Millson introduced in [11] so-called hyperbolic Eisenstein series Epyp(z, s}) 
associated to geodesics in the upper half-plane H, and proved a partial meromorphic 
continuation and a Kronecker limit-type formula for these series. Following this 
point of view, J. Jorgenson and the first author were led to consider so-called elliptic 
Eisenstein series E.(z, 5) associated to elliptic fixed points z) € H for I’. In fact, 
these series were introduced in [9] (see also the unpublished paper [8]) in order to 
derive optimal sup-norm bounds for cusp forms of weight 2 for the subgroup [. 
An alternative, more elementary proof for these sup-norm bounds avoiding elliptic 
Eisenstein series is given in [7]. 

The elliptic Eisenstein series associated to an elliptic fixed point z) € H for the 
subgroup [ is defined by 
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Eales) = SY) sinh(e(o;,'y2))" #0). 


vey \P 


where I,, denotes the stabilizer of zp in’, o,, € PSL2(R) is a scaling matrix for zo, 
Le., 0,,(7) = Zo, and @(z) denotes the hyperbolic distance from z to 7. In the Ph.D. 
thesis [13] by the second named author, the meromorphic continuation of the elliptic 
Eisenstein series €.(z, 5) for any Fuchsian subgroup I of the first kind to the whole 
s-plane is proven using a variation of Colin de Verdiére’s method mentioned above. 
Moreover, various expansions of the series €.(z, 5) are computed and a Kronecker 
limit type formula is established there. 

In this paper we study elliptic Eisenstein series in the special case T = PSL2(Z) 
and z = i. Following the classical approach, the main goal of this paper is 
to establish the meromorphic continuation of the series €.1(z,s) by means of its 
Fourier expansion thereby complementing work carried out in [13] in the special 
case T = PSL»(Z). In order to achieve our goal, the Fourier expansion of E.(z, 5) 
has to be explicitly computed and the growth of the Fourier coefficients has to be 
controlled. 


1.2 Outline of the paper 


The paper is organized as follows. In Section 2, we recall and summarize basic 
notation and definitions used in this article. 

In Section 3, we recall the classical Poincaré series P,,(z,s) and relate them to 
the more recent Poincaré-type series V,,,(z, s) studied in [14]. We review how the 
meromorphic continuation of P,,(z,5) can be obtained via its spectral expansion. 
Via the aforementioned relation, we obtain the meromorphic continuation of 
Vin(z, 8) to the whole s-plane. 

In Section 4, we define the elliptic Eisenstein series €.(z, 5) associated to the 
elliptic fixed point i of PSL2(Z). We show that it is holomorphic for Re(s) > 1 
and an automorphic function for PSL2(Z). In contrast to the parabolic situation, the 
elliptic Eisenstein series fails to be an eigenfunction of Anyp; instead it satisfies the 
differential equation 


(Anyp — 5(1 — 5) Een(z, 8) = —87Een(z, 8 + 2). 


In Section 5, we calculate the Fourier coefficients of E.1(z, 5). In order to simplify 
the exposition, we restrict our study to the case z € H with Im(z) > 1. 

In Section 6, we obtain the meromorphic continuation of E.y(z, 5) via its Fourier 
expansion. The main task here is to first meromorphically continue the mth Fourier 
coefficients d,(y, 5) of E.y(z,s) and then to achieve suitable bounds for a,,(y, 5) 
with respect to m. The main result is stated in Theorem 6.10. 
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2 Preliminaries 


2.1 Basic notation 


Let T := PSL2(Z) be the modular group acting by fractional linear transformations 
on the upper half-plane H := {z = x +iy € C|y > O}, ie. fory = (2 e) el. 
and z € H, we have 


az+b 
cz+d’ 


Yoi= 


We denote by Fr a fundamental domain of Tin H. By ’, := Stabr(z) we denote 
the stabilizer of z € H in IT, and we set 


As usual, we put e(z) := exp(2ziz) and denote by ¢(s) the Riemann zeta 
function. 

In the rectangular coordinates x,y, the hyperbolic line element Sys the 
hyperbolic volume element jlpyp, and the hyperbolic Laplacian Any, on HI are given 
by 


a a? 
ds}, => Khyp = Anyp = -y? =< oF ae Mls 
'yP y? 0x2 dy? 


We recall that the hyperbolic volume volnyp)(Fr) of Fr is given by 
a 
VOlnyp(Fr) = | Lnyp(Z) = 3 . 
Fr 


By dhyp(z, w) we denote the hyperbolic distance from z € H to w € H. 
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2.2. Hyperbolic polar coordinates 


For z = x +iy € H, we define the hyperbolic polar coordinates @ = 0(z), ¥ = 0(z) 
centered ati € H by 


Q(z) = hyp (i,z),  O(z) = ACL, T), 


where L denotes the positive y-axis and T, is the tangent at the unique geodesic 
passing through i and z at the point 7. 


The relation between the x, y-coordinates and the g, }-coordinates is expressed 
through the formulas 


_ sinh(Q) sin(v) _ 1 1 
_— cosh(g) + sinh(g) cos(#)’ = cosh(g) + sinh(g) cos(#)” (1) 


Using the above formulas, the hyperbolic line element and the hyperbolic Laplacian 
in terms of the hyperbolic polar coordinates take the form 


0 1 0 1 0? 
dg? tanh(@) de _—sinh?(g) 00? 


dss = sinh?(9)d9? + dg’, Anyp = — 


From the well-known formula for the hyperbolic distance (see [Bea91], p. 131) 


cosh(dhyp(z, w)) = 1+ seen 
nee 2Im(z)Im(w)’ 


we obtain for z = x + iy € Handy = (44) eT, 


1 
cosh(@(yz)) a cosh(dhyp(z, y~'i)) = ay (2 + (a? + c*)|\z— ae | ): 
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A straightforward computation yields 


cosh(@(yz)) = le + 7)(x? + y?) + 2(ab + cd)x + (b? + d’)). (2) 


2.3 Hypergeometric functions 


For a,b,c € C,c #4 —n (n € N), and w € C, we denote Gauss’s hypergeometric 
function by F(a, b; c; w). For w € C with |w| < 1 it is defined by the series 


F(a,b;c;w) := yO Wt, 
k=0 . 


where (A), := TA + k)/TA) A € C, &k € N) is the Pochhammer symbol; for 
k € N with k > O, we note the alternative formula (A), = ieage + j). For 
Re(c) > Re(b) > 0, the hypergeometric function F(a, b;c;w) has the integral 
representation (see [1], formula 15.3.1) 


1 
F(a,b:c;w) = ae 5 f ta -ara-y dt. (3) 
0 


P(e 


2.4 Parabolic Eisenstein series 


For z € Hands € C, the parabolic Eisenstein series Epa is given by 


Epar(Z, S) = a Im(yz)’. 


y€Too\P 


The parabolic Eisenstein series is known to be holomorphic for s € C with Re(s) > 
1 with Fourier expansion given by 


Epar(z8) =» + 9(s)y'* + D> g(a, sy"? KsipQn|nly)enx), — (4) 
n¥%0 


where K;_,/2(-) is the modified Bessel function of the second kind, 


Ja (s—1/2) €(2s—1) | AQ@s—1) 
T(s) “EQs)A(Qs) 


p(s) = 
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and 


_ 2n5|n|s—1/2 


a oe 2 a s—1/2 
90.9 = Toray Le = Fay LG) 


d\n ab=|n| 


here we set A(s) := a~°/?I(s/2)¢(s). The Fourier expansion (4) provides the 
meromorphic continuation of €par(z, s) to the whole s-plane with a simple pole 
at s = | with residue ress—1€par(z,5) = 1/Volnyp(Fr) = 3/2, and other poles 
contributed by the non-trivial zeros of ¢(2s) in the strip 0 < Re(s) < 1/2. From 
the functional equation A(s) = A(1 —s), we get y(s)g(1 — s) = 1, and hence the 
relation 


2A(2s—1 bys-1/2 
pls)p(n, 1s) Se | 


~ A(Qs)A(—2s + 2) 


ab=|n| 
2 a\s—1/2 
= Kany 2 (6) = g(n,s), (5) 


which, using (4), proves the functional equation 


Epar(Z, s)= p(s) Epar(Z, 1— s). (6) 


3 Poincaré series 


In this section we recall results for two types of Poincaré series that are mostly 
known to the experts. However, for lack of complete reference, some proofs have to 
be elaborated. 


Definition 3.1. For z ¢ H, s € C, and m € Z, the Poincaré series P,, is defined by 


Pm (Zz, 8) = > Im(yz)* exp(—2zr|m|Im(yz))e(mRe(yz)). 
y€Too\P 


The Poincaré series is known to be holomorphic for s € C with Re(s) > 1, since it 
can be majorized by Po(z, Re(s)) = Epar(z, Re(s)). 


Remark 3.2. For m # 0, the Poincaré series P,,(z, s) is bounded on Hi (see [10], 
p. 83) and hence admits a spectral expansion in terms of the eigenfunctions yw; 
associated to the discrete eigenvalues A; of Anyp and the parabolic Eisenstein series 
Epars namely 


loo) 1 oO 
Pin (ZS) = dain (s) Wj(2) + x/ 1/24 ir (S) Epar(Z, 1/2 + ir)dr, (7) 
= a 
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where the coefficients aj m(S), resp. @1/2+irm(S), are given by 
aja (s) = / Pin (Zz, sj (Z) Mhyp(z). resp. 
Fr 


1 /2+irzm (s) = I Pin (Z, NE gala, 1/2 ar ir) Lhyp (2). 
FT 


The expansion (7) is absolutely and locally uniformly convergent for s € C with 
Re(s) > 1. 

As ususal, we enumerate the eigenvalues of the discrete spectrum by 0 = Ao < 
A, < Ag <...; since TF = PSL2(Z), we have A; = 1/44 tj = sj; —S;), Le., 
s; =1/2+it; witht; > 0,aslongas j > 0. For j = 0, the eigenfunction is given 
by Wo(z) = V3/z. For j > 0, the eigenfunction y; is a cusp form and admits a 
Fourier expansion of the form 


Wj) = — py(n)y'? Ks,-12(20|n|y)e(nx). (8) 
nA#0 


The eigenvalues of the continuous spectrum are of the form A = 1/4+r? = s(1-s), 
ie., s = 1/2 + ir withr € R. The corresponding eigenfunctions are given by the 
parabolic Eisenstein series Epar(z, 1/2 + ir). 


Proposition 3.3. For z € H, s € C with Re(s) > 1, andm # 0, the Poincaré series 
Pn (Z, 8) has the following explicit spectral expansion: 


Ce 
Pn (z, 8) 27°! aT(s)|m|5-V? = > Ps —s))P(s + 5; — 1p) (mM Wj @ 
j=l 
1 CO 
+ all Ms — 1/2— inl (s — 1/2 + iry@(m, 1/2 + ir) Epar(z, 1/2 + ir) dr. 
TT J—oo 
(9) 
Proof. The proof can easily be deduced from the spectral expansion given for the 
function P »(z,s) = m°—/7T(s + 1/2) |m|*-"/? Pa(z, 5) in [12], p. 58. o 


Proposition 3.4. For z € H and m F 0, the Poincaré series Py(z, 5) admits a 
meromorphic continuation to the whole s-plane with simple poles ats = s; — N 
and s = —s; -N + 1(N €N) with residues 


TSs=5;—N Py (z, Ss) 


_ (-1)%2 25 j F2N +1 5 sjtN- 'TQs; _ 


N-—1) _ 
_ NIT (s; — N)|m|si-N-1/2 Dd Beln) vel2) 


Se= Sj 
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and 
TeSs=—s,—n+1 Pm(z, 5) 
_1)N 928; +2N-l sj t+NT(_96, — 
ax za Tons TRIES ~ XE Pulm) 
respectively. 


Proof. Due to the lack of reference for the claimed residues, we have to discuss 

the proof briefly. In order to obtain the desired meromorphic continuation we will 

follow closely [12] and [10], and base our argument on the spectral expansion (9). 
We start by discussing the meromorphic continuation of the discrete part 


D(s) := DOT — 5,1 (s + 5; — p,m) ¥@ 


j=l 


of the spectral expansion (9). The argument given in [10], p. 87, shows that D(s) has 
a meromorphic continuation to the whole s-plane with simple poles at s = s; — N 
ands = —s; -N + 1(N €N) arising from the I’-factors. For later purposes, we 
note the bound (see [10], p. 87, adapted to the present situation) 


|D(s)| « y*?, (10) 


where the implied constant depends only on s (not a pole), but is independent of 
z and m. The dependence of the implied constant on s is uniform as long as s is 
contained in a compact set not containing s; — N or —s; — N + 1 forsome N €N. 
For the residues we compute 


(-1)" _ 
1855, D(s) = 4-128; — N 1) YD) Bel) vel2) 
Se=S;j 
and 
(-1)" = 
TeSy=-s,-v+1D(s) = 4-125; — N +1) D7 Bel) We), 
Se=Sj 
respectively. 


We now turn to the meromorphic continuation of the continuous part 


O(s) = ~ [ T(s — 1/2 —inT'(s — 1/2 + im, 1/2 + ir) Evar(z, 1/2 + in dr 
(11) 
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of the spectral expansion (9). By substituting t := 1/2 + ir, the integral (11) can be 
rewritten as 


l 1/2+i00 


(== T(s—#0(s—1+4 Ae(m,1—2) Epar(z.t) dt. (12) 
Ami J1/2-i00 


By construction, the integral (12) exists for s € C with Re(s) > | and represents 
a holomorphic function in this range. The argument given in [10], p. 89, shows that 
Q(s) extends to a holomorphic function for s € C with Re(s) 4 —N + 1/2, and 
for s = —N + 1/2, where N € N. In order to extend Q(s) to the whole s-plane, we 
rewrite the integral (12) by means of a different path of integration (see [12], p. 51) 
using the residue theorem as follows. Let so € C with Re(so) = —N + 1/2 for some 
N € Nand Im(s9) > 0, and let C(so) denote the integration path, which runs on the 
vertical line with Re(t) = 1/2 from —oo to oo as before, but passes on the left-hand 
side around —s9 — N + | and on the right-hand side around sp + N in such a way 
that the only poles of the integrand being encircled by this new integration path are 
located at t = —so — N + 1 andt = so + N. Fors with Re(s) > —N + 1/2 being 
sufficiently close to so such that —s — N + 1 and s + WN are still encircled by the 
path C(so), we set 


O(s) = a Ts — 2) (s — 1 + t)e(m, 1 — t) Epar(z, t) dt, 
4ni C(s0) 


which is well defined by construction. Using the residue theorem and recalling (6) 
and (5), we then compute 


a= | P(s— ft) (s — 1+ t)g¢(m, 1 — t) Epar(z, t) at 
Ari C(s0) 


(-1)% 
ON! 


1? 
2N! 


T(2s + N — 1)g(m,—s — N + 1) Eparlz,s + N) 


ip Ts +N —1)o(m,s + N) Epar(z,-s — N + 1) = O(s). 


For s with Re(s) < —N + 1/2 being sufficiently close to so, we define Os) as 
above and verify again O(s) = Q(s), now using Cauchy’s theorem. By the choice 
of the integration path C(so) it turns out that the integral 


CG = / Fo = AT Gy =1 £000, 1— 1) Eacltdt 
4ri C(so) 


is also well defined. Proceeding in an analogous way for so € C with Re(so) = 
—N + 1/2 forsome N €N, but Im(so) < 0, we obtain the analytic continuation of 
Q(s) to the whole s-plane. 
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All in all, these considerations show that P,,(z,s) admits a meromorphic 
continuation to the whole s-plane with simple poles at s = s; — N and 
s = —s; —-N+1(N € N). The stated formulas for the residues are easily 
obtained from the residue computations for D(s), taking into account that the factor 
2-541 -8t+ 1D (5)! |m|-8+!/2 does not contribute further poles. 

Before finishing the proof, we recall for later purposes that for s € C, we have 
the bound (see [10], p. 90) 


|Q(s)| « y'”, (13) 
where the implied constant depends only on s (not a pole), but is independent of z 
and m. oO 


Definition 3.5. For z ¢ H, s ¢ C, and m € Z, the Poincaré series V,, is defined by 


Vin(Z, 8) 1= > Im(yz)*e(mRe(yz)). (14) 
v€Too\P 


The Poincaré series is known to be holomorphic for s € C with Re(s) > 1, since it 
can be majorized by Vo(z, Re(s)) = Epar(z, Re(s)). 


Lemma 3.6. For z € H, s € C with Re(s) > 1, andm # 0, we have the relation 


[o.e} 2 k 
Vin (Z, 5) = y Cr Pals +). 
k=0 , 


Proof. We first check the absolute and local uniform convergence of the series in 
the claimed relation for fixed z €¢ H and s € C with Re(s) > 1. Using the estimate 


lcz+d|>C|ci+dl, 


where C = C(z) is a positive constant depending on z but which is independent of 
(c,d) € R’, we obtain the bound 


3 ou aA peo ag ye 3 Im(p2Reo) +k 


k=0 . y€Poo\l 


= = (2x|m|)* yReto) . y* 
a ! 2Re(s) 2k 
= k! sete lcz+d| lcez+d| 


ay ke x yew 1 
ae \cz + d |2Re(s) (c2 + d)k 


Z ee Cane 


< exp(22|m|yC~’) - a Re(s)). 
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This proves that the series in question converges absolutely and locally uniformly 
for s € C with Re(s) > 1. 

Now the claimed relation can easily be derived by changing the order of 
summation; namely, we compute 


yD Pn (z,8 +k) 


= 3 Im(yz)* exp(—2z|m|Im(yz))e(mRe(yz)) > Calm)" 


yeToo\I k=0 


Im(yz) 


= > Im(yz)* exp(—2z|m|Im(yz))e(mRe(yz)) exp(22|m|Im(yz)) 
v€Too\P 


Vin (Z; S). 


II 


This completes the proof of the lemma. Oo 


Proposition 3.7. For z ¢ H andm # 0, the Poincaré series V»(z,s) admits a 
meromorphic continuation to the whole s-plane with simple poles at s = s; —2N 
and s = —s; —-2N +1(N €N) with residues 


gla re eg —-N— 1/2) 
(QN)IF(EN + 1/2)|m]72"=17 


TeSs=s;—2N Vin (Z, 5) = 


Y= Bel) Wel2) (15) 


Se=Sj 
and 


22N- 1 NT As; ee 1/2) 
QQN)!D(—N + 1/2)|m|=7—2N F172 


Y= Belm) We), 


Se=Sj 


TESs=—s; —2N+1 VinZ, s) = 
(16) 


respectively. 


Proof. We start by proving that the Poincaré series V,,,(z, 5) has a meromorphic 
continuation to the half-plane 


Hy = {s € C|Re(s) > —N} 


for any N € N. By Lemma 3.6, we can write 


00 k 
Vin 2,8) = y oe Presto D> CH peste. AD 
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We show that the series 
0° k 
2n\|m 
>. Ce” Paless + &) 
k=N+1 , 


is a holomorphic function on the half-plane 1, . For this we estimate as in the proof 
of Lemma 3.6, assuming s € C with Re(s) > —N, 


co k 
2 
Ca Pn (ess +k) 
k=N+1 : 
co k—N-1 
(2x|m|)N*! — (2x|ml) 
= : Pr(zi(s +N +1)+(k-N—1)) 
eal eee! EG =N =D) 
oo k 
(2x |m|) 
< (2x|m|)**" \> Cri D Pn(eas +N +148) 
k=0 


< (2|m|)**! - exp(22|m|yC~?) - Epar(z, Re(s) + N + 1). 


This proves that the series in question converges absolutely and locally uniformly 
for s € C with Re(s) > —N, and hence the holomorphicity statement. 

Since the finite sum peg (22|m|)* /k! Pm(z, 8 + k) is a meromorphic function 
on the whole s-plane by Proposition 3.4, we conclude that V,,,(z, 5) has a meromor- 
phic continuation to the half-plane 14,. Since N was chosen arbitrarily, this proves 
the meromorphic continuation of V,,,(z, 5) to the whole s-plane. 

In order to determine the poles of V,,,(z, 8), we calculate its poles in the strip 


Sy = {seC| —N <Re(s)<-N+1} 


for any N € N. By considering V,,(z, 5) with its decomposition (17) in the strip 
S'y, we see that the poles come from the finite sum Fy (z, 5) := ype (2|m|)*/k! 
P»(z,5 + k). By Proposition 3.4, Fy (z, 5) has poles in the strip Si, at s = s; — N 
and s = —s; — N + 1. The explicit formula for the residues of P,,(z, 5) given in 
Proposition 3.4 now leads to the following residue of Fy (z, 5) ats = s; —N: 


N 


2x|m|)* 
Tess—s,—n Fy (z, 5) = > CTD e821) 0-4) Pal s) 
k=0 ; 
N s/4 bls, +(N—k)4 
=)" (2a|m))* (18 Far POD tig et OP 2s, — (N =k) —1) 
=F (W = EIT; = WV — Bmp OOP 


x YS) peln) Wel) 


SE = y 
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ay ei ko 28; +2N k+l SstN+IDQQ5,; -N+k-1 
ae k!(N —k)IT(s; —N +&)|m|si-¥-12 


1 Bom) We 


Se=Sj 


G1) lg P= 1/2) 


~ HIRCN24 1/aypmprrae DL, Pel ve) 


Se=Sj 


This shows that the residue in question vanishes if N is odd, and that the residue of 
Vin(z, 8) at s = s; — 2N is given by (15). Analogously, it is shown that the residue 
of Fy (z, s) ats = —s; — N + 1 is zero if N is odd, and that the residue of V,, (z, s}) 
at s = —s; —2N + 1 is given by (16). Oo 


4 Elliptic Eisenstein series 


Definition 4.1. For z ¢ H with z # yi for any y € T, ands € C, the elliptic 
Eisenstein series E. is defined by 


Een (Z, 8) = y sinh(o(yz)) °. 


yeri\P 


Lemma 4.2. (i) For z © H with z 4 yi for any y € I, the elliptic Eisenstein 
series Ea (z, 8) converges absolutely and locally uniformly for s € C with 
Re(s) > 1, and hence defines a holomorphic function. 

(ii) The elliptic Eisenstein series E.(z, 8) is invariant under the action of T, i.e., 
we have Eu (yz, 8) = Eun(Z, 8) foranyy €T. 

(iii) For fixed s € C with Re(s) > 1, the elliptic Eisenstein series E.y(z, 5) 
converges absolutely and uniformly for z in compacta K C HH not containing 
any translate yi ofi byy €T. 


Proof. (i) To ease notation, we write s = o +it € C; we assume thato = 
Re(s) > 1. We fix z € H such that z yi for any y € I. Since I acts properly 
discontinuously on H and z 4 yi for any y € I’, the minimum 


Ri(z) = min dpyp (i, YZ) 


exists and is strictly positive. Introducing the quantity 


1- exp(—2Rj(z)) < 


C\(z) = 5) 


0, 


we derive the inequality 


1 — exp(—2o(yz)) 
2 


= Ci(z) 


Elliptic Eisenstein series for PSL2(Z). 411 


for all y € I’. From this we obtain the estimate 


1 — exp(—2e(yz)) 


5 > Ci(z) - exp(o(yz)), 


sinh(9(yz)) = exp(o(y2)) - 


again for all y € I’. From this we derive the estimate 


>. sinned) *] = > sink(owa)” 


yeri\P yeri\C 


<Ci()"- SY) exp(-ca(y2)). 


yeti \P 


In order to complete the proof of (i), we are left to show the local uniform 
convergence of the series 


Y= exp(—ce(y2)) 


yeni \P 


foro > 1. To do this, we introduce for r € Rso the quantities 
G(r) := {y ET: \T |e(va <r}, Nr) := G0). 


We note that the number V(r) is finite, since I acts properly discontinously on H 
and z # yi for any y € T’; in particular, we have N(r) = 0 forO0 <r < Ri(z). 

For fixed r € R,o, we are next going to estimate the number N(r). Let G,(7) 
denote the open hyperbolic disk of radius r centered at i containing the finitely many 
translates yz of z for y € G(r). Then there exists a constant ¢(z) > 0, depending on 
z, such that the open hyperbolic disks 6,,,)(yz) of radius e(z) centered at yz do not 
intersect for all y € G(r) and are contained in B,(7). Consequently, we obtain 


N(r) + Volnyp (Bei (72)) < VOlnyp(B- (i) (vy € G7). 


This yields the estimate 


Na) < 4x sinh? (r /2) __cosh(r)— 1 __ exp(r) + exp(—r) — 2 
= ‘Ao sinh2(e(/2)  2sink(e(Q/2) 4 sink? (e(2)/2) 
1 + exp(—2r) 
< exp(r)- 4sinh?(e(/2) < Cy(z) - exp(r) (18) 


with a suitable constant C2(z) > 0 depending on z. 
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For fixed R € Ryo, the monotone increasing step function N : [0,R] — 
N induces a Stieltjes measure dN(r) on the interval [0, R]. Since the function 
exp(—or) : [0, R] —> Rso is continuous and the function N(r) is of bounded 
variation, the function exp(—or) is Riemann-Stieltjes integrable with respect to 
N(r) on the interval [0, R]. Furthermore, since V(r) and exp(—or) are bounded on 
[0, R], the theorem of partial integration can be applied to give 


R 
> exp(—00(y2)) =i exp(—or)dN(r) 
yer)\P ° 
yeG(R) 


R R 
= I) exp(—o7)|. - N(r)d (exp(—or)) 
R R 
= [yo exp(-or) | es / oN(r)exp(—or)dr. (19) 
0 
Using (18), the first summand of (19) can be bounded as 
[yo exp(-or) | = N(R) exp(—oR) < C2(z) exp((1 = o)R). 


On the other hand, again using (18), the integral in (19) can be bounded as 


R R 
i: oN(r) exp(—or)dr < oc) | exp((1 — o)r)dr 
0 0 
C 
= —— 28) (exp((1 —o)R)- 1). 


Summing up, we arrive at 


d= exp(-se(y2)) = Jim D7) exp(—sa(v2) 
yeri\r yeri\P 
yeG(R) 


< in]oo exp((1 — 0) R) 
C 
+ 720 (exp( —o)R)- :)| 


_ oC2(z) 


ol’ 


keeping in mind that o > 1. The absolute and local uniform convergence of the 
elliptic Eisenstein series €.(z, 5) now follows for s € C with Re(s) > 1. 
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(11) From definition 4.1 we immediately deduce for s € C with Re(s) > 1, 


Ea(yz, 5) = Een (z, 8) 


for all y € T’, provided that z 4 yi for any y ET. 

(iii) Finally, let K C H be a compact subset not containing any translate yi 
of i by y € I. Then the constants C(z) and C2(z) constructed in the first part 
of the proof can be chosen uniformly for all z € K. For fixed s € C with 
Re(s) > 1, the series €.(z, 5) therefore converges absolutely and uniformly on 
K CH. Oo 


Lemma 4.3. Forz = x +iy € Hwithz 4 yi forany y € T, ands € C with 
Re(s) > 1, the elliptic Eisenstein series E.y(z, 8) is twice continuously differentiable 
with respect to X, y. 


Proof. In order to prove the claim, we have to show in a first step that the series of 
partial derivatives 


r) _ a, = 
> ay Sinh(o(y2)) fs 2 5, Sinh(o(y2)) (20) 


yer \C yeti \P 


converge absolutely and uniformly on compacta K C H not containing any translate 
yi of i by y € T provided thato = Re(s) > 1. To do this, we introduce for 
functions f € C!(H) the notation 


Vayp f(z) = (Hey + ()’). 


Letting g(x, y) := (a? +¢7)(x? + y2) + 2(ab + cd)x + (b? +d’), we have by (2), 


EN Ea 


sinh(o(yz)) = cosh’ (o(yz)) -l= ( Dy 


from which we derive 


a. _ cosh(o(yz)) 9 g(x,y) 
an sinh(@(yz)) sinh(o(yz)) "Ox 2y 


(a? + c*)x + (ab +cd) 
; : 


| 


coth(o(yz)) 


and 


p(x, ”)), 


0 
— sinh(o(yz)) = coth(e(yz)) - (Ge +¢’)- dy? 


dy 
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A straightforward computation yields 


Vayp sinh(o(yz)) = cosh? (o(yz)), (21) 


from which we deduce 


a 
Fe sinh(o(y2))| < y~' \VVayp sinh(o(yz)) = y~' cosh(o(yz)). 


= sinh( (o(v2))| < y~' yVyp sinh(o(yz)) = y~! cosh(o(y2)). 


By the choice of the compact set K, there is a positive constant Cx such that the 
inequality cosh (o(yz)) < Cx-sinh (0(y2)) holds for all z €¢ K. Therefore, we obtain 
forze K, 


Pe sinh( (o(v2)) "| < Cx-|s|-y7'- sinh(e(yz)) ° 


< Cx -|s|+y7! + sinh(o(yz)) ° 


Es sinh(o(yz)) ” 


The absolute and locally uniform convergence for the series (20) now follows from 
Lemma 4.2 provided that o > 1. 

To ease notation, we put for the second step x; := x and x2 := y. We will then 
show that for j,k = 1, 2, the series 


2 


0 
Ox; OX; 


sinh(o(yz)) * (22) 
yeti \P 


converges absolutely and uniformly on compacta K C H not containing any 
translate yi of i by y € IT provided that o = Re(s) > 1. Setting f(z) 
sinh(@(yz)), we estimate for z € K, 


a? = 
fee sinh(Q(y2)) . | 
j 
2 
» once 9-09 BOE ca peroe PA 
< |s? +5|- f(Q7et?. 8 na}, | f(Qr ery. Pf 
ax) OxK 
Bole? + slag? FO + 51 fQ7e*) | SLO 


xj Axx Ox, | 
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We are left to estimate the term |0? f(z)/0x ; 0x, |. For this, we use the fact that for 
real functions g(z) = g(x1,X2) defined on H with continuous first- and second- 
order partial derivatives, the inequality 


0° g(z) 
OX j OXK 


ce V Vayp& (z) * Wma | Anypg(z)| 


V Vayp&@) 


holds for all z = x; + ix € H provided that Viypg(z) A 0 (see [5]). Using (21), 
we obtain 


Vigk sinh(o(yz)) = Vayp cosh*(o(yz)) = 4 cosh’ (Q(yz)) sinh (o(yz)), 


which yields 


Viv sinh(o(yz)) 


———————<$—<——_——s i 2 
Tyo imho) 4 sinh“ (o(yz)). 


This, together with the relation | Anyp sinh(o(yz))| = 2 sinh(o(yz)) +sinh(o(yz)) =, 
leads to 


al Vinod 
vy Vnyp f(z) + VWmF@ + | Abyp f(2)| 


= cosh(e(yz)) + 4sinh(o(yz)) + sinh(o(y2)) 


Therefore, by the choice of the compact set K, there is a positive constant Cy such 
that the inequality 


a f(z) 


Ox; OX; 


| = Cee i . sinh(o(yz)) 


holds for z € K. Again, the absolute and locally uniform convergence for the series 
(22) now follows from Lemma 4.2 provided that 0 > 1. This concludes the proof 
of the lemma. Oo 


Lemma 4.4. For z € H withz 4 yi for any y € T, ands € C with Re(s) > 1, the 
elliptic Eisenstein series E(z, 8) satisfies the differential equation 


(Anyp — s(1 — 8)) Een(z, 8) = —87Een(z, 5 + 2). 


416 J. Kramer and A.-M. von Pippich 
Proof. Since the differential operator 


92 1 r) 1 0? 
de? tanh(@) 99 _~—ssinh?(g) 00? 


Anyp = — 


is invariant under the action of I’, it suffices by Lemma 4.3 to prove the equality 
(Anyp —s—- s)) sinh(g)~* = —s? sinh(g)~°t. 
This follows immediately from the subsequent calculation 


Anyp sinh(@)* = s(—s — 1) sinh(g)~°*” cosh?(@) + s sinh(@)~* 
+ s sinh(@)°*? cosh? (Q) 
= (—s* — 5 +5) sinh(o)~“*”) (1 + sinh?(o))+s sinh(@)~* 
= —s? sinh(g) ©t” + s(1 —s) sinh(@). oO 


5 Fourier expansion of the elliptic Eisenstein series 


Lemma 5.1. For z € H with Im(z) 4 Im(y7!i) for any y € T, ands € C with 
Re(s) > 1, the elliptic Eisenstein series E.y(z, 8) admits the Fourier expansion 


Ea(z.s) = >) dm(y, s)e(mx), 


meZ 


where 


b+cd 
Am(y, 8) = » e(m—>) 


ore) ate (a2 + c?)y? +1 2, —s/2 
x =I ‘a —mt)dt. 
[( = 2y z 2(a? +c?)y ) ) com) 


Proof. Since Eey(z + 1,5) = Ecn(z,s), the series Ej (z, 5) admits the Fourier 
expansion 


Eu(Z, S) = > am (y, s)e(mx), 


meZ 
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where 
1 


1 
an(v.s) = f Ean (z, S)e(—mx)dx = > / sinh(o(yz)) e(—mx)dx 


ya(2 B)en\r 


II 


1 
» > i. sinh(o(y(z + 1))) °e(—mx)dx 

_(ab , néeZ@ 0 

yas Servo 


= » a sinh(o(yz)) °e(—mx)dx. 


y=(2 4 en\T/Teo 


Now, writing sinh’ (e(yz)) = —1 + cosh(o(yz)), using (2), and substituting ¢ : 


x + ab+cd 


ater WE obtain 


cosh(o(yz)) 


L feo as Di ge A050 1 
5 (@ +07)? + (a Oy ts ) 
a ay Cn (a? +c?yy? are | 

2y 2(a? + c?)y 


From this the claimed formula for a,,(y, 5) follows immediately. 


Proposition 5.2. For z © H with Im(z) > 1, ands € C with Re(s) > 1, we have 


y2* . Vo(s + 2k), 


BEE), ga SE 


BOS TG) + Dek! 


k=0 


where 


Proof. Letting m = 0, we derive from Lemma 5.1 


do(y, S) — 2S boy (y. 5), 


v=(4 5 )er\/Too 
where 
ee ate (a? + c?)?y? + 1\2\-8/2 
bo. (v.58) :=2 (-1 ( 1? ) ) dt 
oy (y,5) i + 2y a Wae+e)y 


ize # az (a? + c?)?y? + a 
0 2y 2(a? + c?)y vt 


oO 
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Substituting 
ew (EHO )Y + IP (: (@ +e*)y + yy 
> (a? +2)? (a2 +c?) 
we obtain 
2 y5 (a? 4 c7)s7! [ 3% 7 
b sy= e240 =r) 
I= Gray f” OO? 


4(a* + c*)y =H/2 
x (1 _ (+7 +12 -r) r. 


Now using the integral representation (3) of Gauss’s hypergeometric function 
F(a’, b’;c'; w) with 


a b':=s c s, and AGE he) 
=, =S—-, ay wis : 
2 2 ((a2 + c?)y + 1)? 


which is justified since Re(c’) > Re(b’) = Re(s) — 1/2 > 0, we obtain 
2 y(a2 +02)! Ja T(s— 1/2) 


(@+ey+ De" TC) 


2 2 
«FS. 538% 4(a- + c*)y ). 
2 2 a? hie? ye Ly? 


boy, s) = 


Since T = PSL2(Z) and y > 1, we have 


A(a? +.c?)y 
(a tc?) y eA 


and so the hypergeometric function in question can be represented as a series, which 
shows that 


F(3 1 A(a* + c”)y ) ( 1s A(a? + c?)y ) 
S215, = = StS} ; 

2 2° ((@ +7)y + 1)? 2° 2°" (a? +?)y + 1)? 

Now the hypergeometric function under consideration is of the form F(b’,a’; 

2a’; w), which allows us to apply the following formula (see [1], formula 15.3.17): 
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F(b’, a’; 2a’; w) 


2 
/ , 1 1 f/l-vl1- 

= 9? 14/1 a—w) F | bb =a + us 

2 2\14+VJ1-—w 
(23) 

Again, since y > 1, we have 
i A(a2+c?)y  — (a® +c?)y-1 
(@reaytip  @reyy er 


which leads to 


ls A(a? +. c*)y 
Fl(s—=,=:s; 
2°2’ ’ ((a2+c?)y +1)? 


_ oi (2@+ey VP" (, dss dd 
7 (a2+¢?)y +1 ooo (a2 + ¢?)?y? 


ls is oe. 1 
2 2) .\-2s+1 (¢,2 2 2s-1 ‘ 
= 1 F ; . 
(a + c°)y) (a> +e )y +1) (s- 22’ 77 2 ann) 
(24) 
Adding up, we obtain 

2° ./n V(s— 1/2 Irs lss 1 1 

boy (y,8) = — ss OSG sms aa) 
T(s) (a* + c*)s 222 2 (a*+c?’)*y 


Introducing the notation 


2° ./aT(s — 1/2) 


g(s) = T(s) , 
we arrive at 
ag(y, 5) 
= g(s):y'* pz as F(s- 3.535 + ca) 


2 
ll 
aS 

a 


o b Veri\I/Too 
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sph ed Sea ) 
= g(s)-y » @roy La Dy kl (a2 + c?)? y? 


v=(4 5 )eriA\P/Too i 


es Wa DE Deo 
g(s)-y » + Dyk y o(s ) 


This completes the proof of the proposition. oO 


Remark 5.3. The statement of Proposition 5.2 can easily be generalized to the case 
z € H with Im(z) ¥ Im(y7!/) for any y € T as follows: When applying formula 
(23) in the case y < (a? + c*)~!, formula (24) becomes 


ls 4(a*+0c?)y 
Fs 33" a@eay +P) 


= ((a* +¢7)y +1)? LF(e=5.5 Sst (a? + ¢*)’y *). 


Therefore, we arrive at 


ao(y, s) 


=gs):y'* YS @ 4 Yee =e eo -((@ +¢?)y) 
y=(2# Jen Tits 


y>(a2+c?)—! 


+ g(s)-y° YO @+e’) Dy, z ares so -(@ +e?)y)* 


y=(2 a )ETAL/Too 
oe +c?)7! 


Proposition 5.4. For z € H with Im(z) > 1, s € C with Re(s) > 1, andm # 0, 
we have 


= Onne: 
an(v.s) = 29" ST Ber Eom(8i kas ka) + Vols + 2k + 2k), 
k,=0k2=0 


where 


Im(¥.5; 1, ka) = / (y + i) (y — it) e (mt) dt 
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and 


Vins) =D) Im(yi)°e(mRe(y~"i)). 


yeTi\T/Too 
Proof. Form # 0, we derive from Lemma 5.1 


Am(y, 5) = y om Vom 8) 


az + 
v=(4 2 )er\P/Too 


where 
loo) az +c? (a? + c?)*y? +1 2 —s/2 
bn (¥, 8) 1= -1 t? —mt) dt. 
5) [( +( ag oS oa ey )) e(—mt) 
(25) 
We write 


—1+ e ee eg Oe TF 


2y 2(a? + c?)y 
~ ae (? 7 (» is a : 2)) G bs (»  @ : =) ) 
(a? + c?)? 


near (i+ y+ aa) (“+ y ae =a) (t+ ¥- —. 


: 1 
«(Herat 


_@+e?yP hes. at f 
= “Gyr 0 + '0-0'(\- GEG aap) 
1 
a (1 — (@+eP2(y— ae): cau 


Since y > 1, we have the estimate 


1 1 
= <1, 
eee ESTO) ee 


and hence we can write 


I ae (sk ue 
(1 —(@+EPG + a) = ogee + BF 4 k(a? ki(a2 + c2)% (ytin™. 
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Therefore, we obtain 


_ Qy)y (kr > Qe 
bnny (YY, 8) = (a (a2 + 02) s 3 ky!- kp! - (a2 + c2)2ki +k2) 


ky =0k)= 0 
[oe 
x i (y + it) (y — it) *- 7” e(—mb) dt, (27) 
—oo 


from which the statement follows. oO 


Remark 5.5. The statement of Proposition 5.4 can be generalized to the case z € 
H with Im(z) 4 Im(y~!i) for any y € I. In this case the Fourier coefficient in 
question becomes 


ab + cd 
amo= em *) 906.5) 

v=(4 5 )eri\P/Too 

y>(a2+c?)—! 

ab + cd p(2) 

oF 2; e(m—>) bin.y VS)» 

y=(4 5 eM \r/Too 

y<(a?+c?)—! 


where 


(a Qe 
©) ye 5 Da On 1 2 
De ws y= aaaS s ile ko ! Lamjtarer)((@” +e Vy, S; ky,ke), 
ki =0 ko =0 
(28) 


Ctr Gli 1 
bry, = a 2 ere ky! ky! ‘Lom eas ©)y 83k, ka). 
(29) 


Here be (y, 5) is obtained as in the proof of Proposition 5.4 (after a suitable 


substitution in (27)), whereas ae y, 5) is obtained by rewriting (26) as 


2 2 2 Ve aa 
-1+(5 ef (a“ + c*)*y~ + ) 
2y 2(a? + c*)y 


_@+ 7 1 rn =) 1 2 
7 4 (a2 +c?)y ys \@+e%y y 


x (: = e Hy ap *)) (: = or > )* 
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which, after using the expansion 


—s/2 00 : 
1 it\ 2 (2)k 1 it —2k 
(-(tas+5 = 2 kl (eaay 9) 


yields the claimed formula (again after a suitable substitution in the corresponding 
integral). 


Remark 5.6. The series V,,(s) (m € Z) of Propositions 5.2 and 5.4 can be rewritten 
as follows: Consider the anti-isomorphism ¢ : T > T given by y & y7!. Since 
b(Too) = Moo and (T;) = Tj, we have @(Ta\P/T;) = Ti \/ Too. Therefore, 
we obtain 


Vin(s) = ye Im(y7!i)*e(mRe(y~'i)) a > Im(yi)*e(mRe(yi)) 
yeTi\P/Too y€Doo\P/Ti 
1 
= —Vali, 
5Ymn(iss) 


with the Poincaré series (14) evaluated at z = 7. Note that the series Vo(s) multiplied 
by €(2s) equals the Dedekind zeta function associated to the field of Q(/). 


6 Meromorphic continuation of the elliptic Eisenstein series 


Lemma 6.1. The series 


1 
Vo(s) = > @tey 


converges absolutely and locally uniformly for s € C with Re(s) > 1, and hence 
defines a holomorphic function. It has a meromorphic continuation to the whole s- 
plane with a simple pole at s = 1 and poles at s = p/2, where p is a non-trivial 
zero of €(s). Furthermore, we have Vo(1/2) = 0. 


Proof. Since Vo(s) = Epar(i,s)/2, the claimed assertions immediately follow 
from the known properties of the parabolic Eisenstein series €,-(z, 5) recalled in 
Section 2.4. In particular, the vanishing of Vo(s) at s = 1/2 follows from the 
functional equation (6) by observing that (1/2) = —1. Oo 


Lemma 6.2. For z € H with Im(z) > 1, and N €N, the series 


ea —1l),.(% 
~~ s 2k Ge -y 7k . Vo(s + 2k) 
oa TG + 5+k)-k! 
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converges absolutely and locally uniformly for s € C with Re(s) > —2N — 1, and 
hence defines a holomorphic function. 


Proof. Fix N € N, and let s € C with Re(s) > —2N — 1. Then, for k € N, we 
define the functions 


(s — 5k 7 (Sk yok 
rg+3+k)-k! 


Sis 8) = 8k, 8)- Vols + 2k), where gx(y, 8) = 


Ifk > N +1, we have Re(s + 2k) > Re(s) + 2N +2 > 1, whence the functions 
Vo(s + 2k) are holomorphic. Since the functions g;(y,s) are also holomorphic in 
the range under consideration, the functions f(y, s) are holomorphic for s € C 
with Re(s) > —2N — 1 as longask > N + 1. We now estimate 


Y> | fev. 5)| < Vo(Re(s) +2N) SY > |geG,5)]. 
k=N+1 k=N+4+1 


Since the ratio of successive terms in the latter series has limit 


; —~liky(S+k 
™ [ge+1 5) _ km (f= 4 e)(5 a ). 1)_1 2: 


koo |ge(y.s)] Koo (S44 4K)04k) y?2] 


we derive from d’ Alembert’s criterion that the series 77°41 fe (J, 5) converges 
absolutely and locally uniformly for s € C with Re(s) > —2N — 1, which proves 
the claim. oO 


Proposition 6.3. For z € H with Im(z) > 1, the function ao(y,s) has a 
meromorphic continuation to the whole s-plane with possible poles ats = 1—2N, 
s = p/2—2N, 5s = 1/2—2N, ands = —-1/2—2N (N €N), where p is a 
non-trivial zero of €(S). 


Proof. We start by proving that the function ao(y, s) has a meromorphic continua- 
tion to the half-plane 


Hy := {s € C|Re(s) > —2N — 1} 


for any N €N. By Proposition 5.2 and the duplication formula for the [’-function, 
we can write, using the notation from the proof of Lemma 6.2, 


2 Ju Vts—*)r(é +4 ut = 
woe BAD (Shown oe i) 


T'(s) k=0 k=N+1 


2nT(s—i N 00 
- Poe (S noo ~~ fon) (30) 


k=N+1 
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Since Re(s) > —2N — 1 by assumption, Lemma 6.2 proves that the series 
eat +1 f(y, 8) is a holomorphic function on the half-plane Hy. Since the finite 
sum oe Jx(y, 5) is a meromorphic function on the whole s-plane by Lemma 
6.1, we conclude that ao(y,s) has a meromorphic continuation to the half-plane 
Hy. Since N was chosen arbitrarily, this proves the meromorphic continuation of 
do(y, 8) to the whole s-plane. 

In order to determine the poles of ao(y, 5), we calculate its poles in the strip 


Sy :={seC| —2N—-—1 < Re(s) <-2N 4+ 1} 


for any N € N. By considering ao(y, 5) with its decomposition (30) in the strip 
Sy, we see that the poles come from the finite sum oe Jk (y, 5), which has poles 
in the strip Sy arising from the function fy (y,s5), more precisely from the factor 
Vo(s + 2N) ats = 1—2N ands = p/2 —2N, where p is a non-trivial zero of 
€(s), and from the I'-factor T'(s — 1/2) at s = 1/2-—2N ands = —1/2—2N. 
Therefore, the possible poles of ao(y, 5) in the strip Sy are located ats = 1—2N, 
8 = p/2—2N,s =1/2—2N, ands = —1/2—2N, as claimed. o 


Remark 6.4. Using Remark 5.3, one can establish the meromorphic continuation 
of ao(y, 8) to the whole s-plane in the more general case Im(z) # Im(y~!i) for 
any y € I’, using the same techniques as in Lemma 6.2 and Proposition 6.3 applied 
accordingly to the modified situation. The poles of ao(y, 5) turn out to be same as 
in the case Im(z) > 1. 


Lemma 6.5. For y > 1,5 € C,m £0, andky, kz EN, let Im(y, 8;ky, kz) denote 
the integral 


Ce ee / (y + iy" (y — i)? 0 dt 


Then the following assertions hold: 


(i) The integral In(y,5;k\,k2) converges absolutely and locally uniformly for 
s € Cwith Re(s) > 1/2 —k, —ky, and hence defines a holomorphic function. 
(ii) The integral I,(y,5;k,,k2) admits a holomorphic continuation to the whole 
S-plane. 
(iii) Let QC C be acompact subset and let d € N be such that Q C {s € C |Re(s) > 
1/2 —k, — ky —d/2}. Then, we have for all s € Q the bound 


d 
(ki + kp) Ly WRe(s)+ki +ky+d/2)+1 


[Im (9.83 k1,k2)| « im|4 


where the implied constant depends on Q and d, but is independent of m and 
ky, ko. 


426 J. Kramer and A.-M. von Pippich 


Proof. (i) For s € C with Re(s) > 1/2 — ky — ko, we have the estimate 


foe) 
Eee as ki ,k)| < / l(y ay iG) e(mt)| dt 
—0oo 
foe) 
-| (i ePy rae dt 
—0o 


oo t2 \ —Re(s)—ki—k2 dt 
= —2(Re(s) +k, ka) 1 
x ») y 
, © (Re(s) — 1/2 + ky + ky) 
T'(Re(s) + ky + ka) 


2(Re(s)+k1+k2) 


=) 


< ey 2Re) tea) H (31) 


For all s € Q, where Q C {s € C| Re(s) > 1/2 —k, —k2} is a compact subset, we 
therefore obtain the bound 


[Zn(y, 8; kt, ka)| <7, 


which shows that the integral J,,(y,s8;k1,k2) converges absolutely and locally 
uniformly for s € C with Re(s) > 1/2 — ky — ko. 
(ii) Let Re(s) > 1/2 — k, — ko. Integration by parts yields 


CO 
Im(y,8:k1,k2) = lo + ity 7 (y — ins om 


2rim |;— 009 


+ ime 
2am 


1 2 : : 
Ser / (s + 2ko)(y + it) 971 (y = it) 8-22! emt) drt. 
—Co 


CO 
/ (s + 2k1)(y + it) 8-241“ (y — it)? emi) dt 
—oo 


In absolute values, the boundary term equals 


e(mt) 


2nim 


(y? Bi a a a 
~ 21 |m| 


lo +4 ne (y _ uy 


Since Re(s) > 1/2—k, — kz and y? + ¢? > 1 fort € (—oo, 00), we have 
(y? me oe i a < (y? +4 Py, 


from which we conclude that the boundary term vanishes. Therefore, we obtain the 
recurrence formula 
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(42m) 7, 1 
— In(y. 83k + 5k) 


(s + 2k2) 1 
= In (y. ikke 4 sh (32) 


Ind, 53k, k2) _ 


By part (i), both terms on the right-hand side are holomorphic for s € C with 
Re(s) > —k, — ko. In this way we obtain the holomorphic continuation of 
In(y, 8; k1, kz) to the half-plane {s € C| Re(s) > —k, — kp}. 

Let d € N. Applying relation (32) d times, we arrive at a formula of the type 


d 
1 
In(y, 83k1,k2) = (Qxm)t > Pa,j (s3k1,k2) 
j=0 


. f=? 
x In(y,s:ki + £,ke + =), (33) 
2 2 
where Py; (8; k,, kz) is a polynomial in s and k,,k2 of degree d. In fact, one can 
prove by induction on d that 


Pa j(8;k1,k2) = (-1)47 - e “ > (8 + 2k1); «(8 + 2k2)a-; 
O<j<d). 
Now all the terms in (33) are holomorphic for s € C with 
Re(s) > 1/2 — (ky + j/2) — (ko + (d — j)/2) = 1/2 -—k, — kp — d/2. 


Therefore formula (33) yields the holomorphic continuation of [,,(y,5;k,,k2) to 
the half-plane {s € C|Re(s) > 1/2 —k, — ky — d/2}. Since d € N was chosen 
arbitrarily, this proves the holomorphic continuation of [,,(y, 5; k , kz) to the whole 
s-plane. 

(iii) Let & C C be a compact subset and let d € N be such that Q C {s € 
C|Re(s) > 1/2—k, —k, —d/2}. Fors € Q, the function J, (y, s;k1, k2) is given 
by formula (33). Since Re(s) > 1/2 —k, — ky —d/2 = 1/2— (ki + j/2) — (ko + 
(d — j)/2), the bound (31) provides the estimate 


j d a 1 i 4 
ta(y.sths + Eka + 25 2)) ec yremmnttiaensares 


where the implied constant is universal. Furthermore, letting s € Q, we have the 
bound 


|Paj(sski,ko)| Kk) RSS K (ki th)’ (<j <d), 
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where the implied constant depends on Q and d, but is independent of m and k,, 
k>. Altogether, as long as s € Q, we have the bound 


—2(Re(s)+ki+k2+d/2)+1 


(2|m|)4 


y 


[Zm(y, 83 k1,k2)| K (d + 1+ (ky + ka)? 


d 
(ki + ka) Ly 2Rels) thi tke-+d/2)+1 
|m|4 


where the implied constant depends on Q and d, but is independent of m and 
ki, ko. Oo 


Lemma 6.6. Form # 0, the following assertions hold: 


(i) The function V,,(s) admits a meromorphic continuation to the whole s-plane 
with possible simple poles at s = s; —2N ands = —s; —2N + 1 
(N €N). 

(ii) Let N € Nand C {s € C|Re(s) > —2N — 1} a compact subset not 
containing any pole of Vin(s). Then, for all s € Q, we have the bound 


RA) al ce 


where the implied constant depends on Q and N, but is independent of m. 
(iii) Let N € Nand 5 a pole of Vin(s) with Re(s) = —2N + 1/2. Then, the residue 
of Vin(s) at S is bounded by 


espe Ven (s)| <« |m|?%, 


where the implied constant depends on § and N, but is independent of m. 


Proof. (i) Since we have Vin(s) = Vn(i, 5)/2 by Remark 5.5, the claim follows 
immediately from Proposition 3.7. 

(ii) We will prove the claim more generally for the Poincaré series V,,,(z, s) for 
any z € H. Fors € &, we then consider the decomposition 


k 
—— P,, ZS5+ k). (34) 


From the proof of Proposition 3.7 we recall that the series on the right-hand side 
converges absolutely for s € (2. Hence, we can rearrange the summation and find 
fors € Q, 
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ye Palos + &) 
k=2N+2 : 


oO (27 |m|)2% +k+2 


— QN +k +2)! 


Pn(z,s +2N +k +2) 


3 (2|m|)* > Im(y)"t28 +42 


= (22|m|)7% +? 
! 
cam (2N+k+42)! nae 


x exp(—2z|m|Im(yz))e(mRe(yz)) 


= 2x|mjpen?? 


> Im(yz)*t24 +? 


v€Poo\P 


oe) 


x exp(—2z|m|Im(yz))e(mRe(yz)) ~ 
k=0 


(2z|m|Im(yz))* 
(QN +k +2)! 


< (Qa |m|)?%*?_ S$ Im(yz) R42"? exp(—2r|m|Im(yz)) 


y€Too\P 
oO k 
2 I 
<5 ximlln(y2) 
— k) 


= (2n|m|)?" +? . Epar(z, Re(s) + 2N +2) « |m?P?%*?, 


where the implied constant depends on z, Q, and N, but is independent of m. 

In order to estimate the finite sum in the decomposition (34), we multiply the 
bounds (10) and (13) by the factor 2775+! 78+! P(s)7!|m|+!/2, and derive from 
the spectral expansion (9) of P,,(z, 5) for all s € Q the bound 


| Pn(z, 5 + k)| & jen[ Rt?  jmPN 432k = 0,...,2N + 1), 


where the implied constant depends on z and Q, but is independent of m. Hence, for 
all s € Q, we obtain 


2N+1 
[Vin(z,8)| « » ln « |m|2N-k+3/2 am 2NF2 < | PN, 
k=0 


where the implied constant depends on z, Q, and N, but is independent of m. This 
proves the second claim. 
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(iii) In order to prove the third claim, we recall formulas (15), resp. (16), together 
with the bound (see [10], p. 86, adapted to our situation) 


| pe(m)|” < |t;|exp(z|t;|) (LEN: se=s; = 1/2 + it;), 
where the implied constant is universal. Then we obtain 
[eS sagj on Veale) < lal EEN N? < ln/PR, 
resp. 
fres.——s)—an-+1Vin(Z,s)| <K m|REDHN—W2 < Imp, 


where the implied constants depend on z, s;, and N, but are independent of m. O 


Lemma 6.7. For z € H with Im(z) > 1, m 4 0, and N €N, the series 


Ga (De 
2° y* + y Ea ko ! = » Tin (y, 83 k1, k2)- Vin (s + 2k, + 2k») 
n=N+1kj+ko.=n 


converges absolutely and locally uniformly for s € C with Re(s) > —2N — 1, and 
hence defines a holomorphic function. 


Proof. Let 2 € {s € C|Re(s) > —2N — 1} be a compact subset. For s € Q and 
k,,k2 € N, we define the functions 


a: Qe 


Sink ko (Y; S)i= 2*y ky! ky! 


5 I_mQ, S; k1,k2) * V_n(s + 2k + 2k). 
Ifk; + ky > N +1, we have Re(s + 2k, + 2k2) > Re(s) + 2N +2 > 1, whence 
the functions V_,,(s + 2k, + 2k2) are holomorphic for s € Q. By Lemma 6.5 (ii), 
the functions [_,,(y,5;k,, kz) are holomorphic for s € C. Therefore, the functions 
Fin:ky ko (Y, 8) are holomorphic for s € Q, as long as ky +k. > N + 1. Now choose 
d €Nwithd > 2N +1; then we have Q C {s € C| Re(s) > 1/2—k, —kn—-d/2}, 
as long as kj + ky > N + 1. Using Lemma 6.5 (iii), we estimate for s € Q, 


DDE Fmt. kos 8)| 


n=N+1k,+k.=n 


« Vo(Re(s) + 2N + 2) 


Rey RAIMA SSI Bl tay! 
d 1. hel 2k ko) 
|m| n=N+1ki+k2=n Keyl ke! ee 
—Re(s)-d+1 © 


|m|4 


Z = lo) 
Re(s)—d+1 n@ (Is|)n 


ki!-(n—ky)! |m|4 


n@ x (baba (5 Dn—ky y 
2n 2n- yl 


n= N+1 n=N+1% 
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where the implied constants depend on Q, d, and N, but are independent of m and 
k,, ky. Since the ratio of successive terms in the latter series has limit 


(n+ 1)¢-(n+Is) 1 
n?-(n+1) y? 


noo 


we derive from d’Alembert’s criterion that the series in question converges abso- 
lutely and locally uniformly for s € C with Re(s) > —2N — 1, which proves the 
claim. 

For later purposes, we note for s € Q the bound 


< |m|~“, (35) 


> 2 Tins ko (Ys 8) 


n=N+1kj+ko=n 


where d € N withd > 2N +1, and where the implied constant depends on z, Q, d, 
and N, but is independent of m. oO 


Proposition 6.8. For z € H with Im(z) > 1, andm 0, the following assertions 
hold: 


(i) The function ay(y,s) admits a meromorphic continuation to the whole s- 
plane with possible simple poles ats = s; —2N ands = —s; —-2N +1 
(N EN). 

(ii) Let N € Nand C {s € C|Re(s) > —2N — 1} a compact subset not 
containing any pole of an(y, 8). Then, for all s € Q, we have the bound 


|am(y.s)| << |m|~“, 


where d € N withd > 2N + 1, and where the implied constant depends on z, 
Q, d, and N, but is independent of m. 

(iii) Let N € N and 5 a pole of am(y, 5) with Re(s) = —2N + 1/2. Then, the 
residue of Am(y, 8) at S is bounded by 


|res;=34m(y,5)| << |m|~4, 


where d € N with d > 2N + 3, and where the implied constant depends on z, 
8, d, and N, but is independent of m. 


Proof. (i) As before, we obtain the meromorphic continuation of a,,(y, 5) to the 
whole s-plane by constructing its meromorphic continuations to the half-planes 


Hy := {s € C|Re(s) > —2N — 1} 
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for any N € N. Applying Proposition 5.4 and using the notation from the proof of 
Lemma 6.7, we can write 


N ee) 
an s)= >) D> fain + Do DD fain @s) (G6) 


n=0ki+ko=n n=N+1kj+h=n 


Since Re(s) > —2N — 1 by assumption, Lemma 6.7 proves that the series 


yy DS hee) 


n=N+1ki+k.=n 


is a holomorphic function on the half-plane #1. Since the first double sum in (36) 
is a meromorphic function on the whole s-plane, we conclude that a,,(y,s) has a 
meromorphic continuation to the half-plane Hy. 

In order to determine the poles of a,,(y, 5), we calculate its poles in the strip 


= {seC| —2N—-1 < Re(s) <-2N +1} 


for any N € N. By considering a,,(y, 5) with its decomposition (36) in the strip Sy, 
we see that the poles come from the finite sum 


y 2 Im; sky kas s)= — 2*y" 3 Vim(s + 2n) = ata ae peace 


n=0 kj +k,=n n=0 


xX Tin (y, 83k1,n — ky), 


which has possible simple poles at s = s; —2N and s = —s; —2N + 1 in the strip 
Sy arising from the factors V_,,(s + 21) (n = 0,..., N). Therefore, the possible 
poles of a,(y, 5) in the strip Sy are located ats = s; -2N ands = —s; -2N+1 
(N EN). 

(ii) In order to prove the second claim, we let s € Q, where Q C Hy is acompact 
subset not containing any pole of a,,(y, s), and we decompose a;,(y, 5) as in (36). 
Choosing now d’ € N with d’ > 4N + 3 and applying the bounds obtained in 
Lemma 6.5 (iii) (note that Re(s) > 1/2 —n — d’/2 forn = 0,..., N) and Lemma 
6.6 (ii) (note that Re(s) + 2n > —2(N —n)—1 forn = 0,..., N) to the finite 
double sum in (36) and the bound (35) to the remaining series in (36), we obtain the 
estimate 


N n 
|am(y.8)| K Do |Vim(s + 2n)| > [Lm (y, 8: ki. — ky)| + |m|-" 
n=0 ki=0 


N 
<K So mPN*? « ||" + in «& ||“ PN), 
n=0 
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where the implied constants depend on z, 2, d’, and N, but are independent of m. 
Setting d := d' —2N —2 and observing that d > 2N + 1, the proof of part (ii) is 
complete. 

(111) As in the proof of (ii), we work from the decomposition (36). We let 5 be a 
pole of a»,(y,s) with § = —2N + 1/2, ie., 5 € Sy. As before, choosing d’ € N 
with d’ > 4N + 3, the bounds obtained in Lemmas 6.5 (iii) and 6.6 (iii) give the 
estimate 


N n 
tes, (,,)| < > ese F 2n)| ba el ¥, Siki,n —k,)| 


n=0 k,=0 


N 
yi ml <a, 


n=0 


where the implied constants depend on z, 5, d, and N, but are independent of m. 
Setting d := d'—2N and observing that d > 2N + 3, the proof of part (iii) is also 
complete. Oo 


Remark 6.9. By means of Remark 5.5, one can establish the meromorphic con- 
tinuation of ay,(y,s) (m ¥ 0) to the whole s-plane in the more general case 
Im(z) 4 Im(y7!i) for any y € I" by applying Lemma 6.5 (noting that this lemma 
also holds for y > 0 andm € R,m # 0) as well as by using the same techniques as 
in Lemma 6.7 and Proposition 6.8 applied according to the modified situation. The 
poles of a,,(y,5) and their residues turn out to be same as in the case Im(z) > 1. 
Moreover, also the statements (ii) and (iii) of Proposition 6.8 generalize to the case 
Im(z) 4 Im(y7!i) for any y ET. 


Theorem 6.10. For z € H with Im(z) > 1, the elliptic Eisenstein series Ey (z, 5) 
has a meromorphic continuation to the whole s-plane with possible poles at s = 
sp —2N,s = 8; —2N, ands = —s; —-2N +1(N €N), where sy is a pole of 
D(s—1/2)Epar(i, 8), and s; = 1/2+it; witht; > Oands;(1—s;) = A; a discrete 
eigenvalue of Anyp. 


Proof. Let z € H with Im(z) > 1. For s € C with Re(s) > 1, we represent the 
elliptic Eisenstein series €.(z, 8) by its Fourier expansion 


EonlZ,8) = D> am(y,s)e(mx), (37) 


meZ 


where the coefficients a,,(y, 5) are explicitly given by Propositions 5.2 and 5.4 for 
m = Oandm # 0, respectively. By Propositions 6.3 and 6.8, the functions a» (y, S$) 
admit a meromorphic continuation to the whole s-plane. 

In order to prove the meromorphic continuation of €.(z, 5) to the whole s-plane, 
let s € Q, where Q C {s € C|Re(s) > —2N — 1} for some N € N is acompact 
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subset not containing any pole of a,,(y,5) for all mm € Z. Choosing d € N, 
d >2N + 2, we have by Proposition 6.8 (ii) the bound 


YE lam (y, sye(mx)| < D2 ||, 
meZ meZ 


m#z0 m#0 


where the implied constant depends on z, Q, d, and N, but is independent of m. 
Therefore, the Fourier expansion (37) converges absolutely and uniformly in Q. 
This proves that €.(z, s) is holomorphic in s € C away form the poles of a@,(y, 5) 
form € Z. 

Now let 5 € C be a pole of a,,(y, s) form # 0 as in Proposition 6.8 (i); then, 
Re(S) = —2N + 1/2 forsome N € N. Choosing d € N, d > 2N + 3, we estimate 
using Proposition 6.8 (iii), 


lim(s —5) D3 am(y,s)e(mx) K D> |ress=sam(y.s)| K Do ml, 
ia meZ meZ meZ 


m#0 m#0 m#z0 


where the implied constant depends on z, 5, d, and N, but is independent of m. 

In this way we obtain the meromorphic continuation of E.(z, 5) to the whole 
s-plane with possible poles at s = sp -2N,s = 5s; —2N,ands = —s; -2N +1 
(N €N). The poles at s = sp — 2N are contributed by ao(y, 5); here sp denotes a 
pole of P(s — 1/2)Epar(i, s). Oo 


Remark 6.11. Using Remark 6.9, one can establish the meromorphic continuation 
of Ez(z, 5) to the whole s-plane in the more general case Im(z) 4 Im(y~!/) for any 
y €T. The poles of €.(z, s) and their residues turn out the be same as in the case 
Im(z) > 1. 


Remark 6.12. The elliptic Eisenstein series €.(z, 5) has a simple pole ats = 1 
with residue 


Vo(s) ) 
T'(s/2+ 1/2) 
a 1, 
tT; VOlnyp (Fr) 


ress=1Eey (Z, 5) = ress=1do(y, $) = 2mress=| ( 
= mress=1Epar(i, 8) = 


here we used the decomposition (30) for ao(y, s) with N = 0. 
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Consequences of the Gross—Zagier formulae: 
Stability of average L-values, subconvexity, 
and non-vanishing mod p 


Philippe Michel and Dinakar Ramakrishnan 


In memory of Serge Lang 


Abstract Applying the celebrated results of Gross and Zagier for central values of 
L-series of holomorphic forms of prime level, we deduce an exact average formula 
for suitable twists of such L-values, with a relation to the class number of associated 
imaginary quadratic fieds, thereby strengthening a result of Duke. We also obtain a 
stability result, as well as subconvexity (in this setting), and certain non-vanishing 
assertions. 
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1 Introduction 


In this paper we investigate some consequences of the Gross—Zagier type formulae 
which were introduced by Gross and Zagier and then generalized in various 
directions by Hatcher, Zhang, Kudla, and others [1, 11, 12, 14,40]. Let us now recall 
these formulae in the classical context. Denote by K an imaginary quadratic field of 
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discriminant —D, associated quadratic character y_p = (—2), and ring of integers 
Ox. For any character V of the ideal class group Pic(Ox) of K, let gw be the 
associated weight one theta series attached to V on the upper half plane H given by 


gu(zZ) = > ru(m)q™, ¢g = exp(2m1z),z € H, 


m>0 


where, form > 1, 


ry(m)= >) Ya) 


N(a)=m 


and a C Ox ranges over the Ox-ideals of norm m. We will denote the trivial 
character of Pic(Ox) by |x. 

Now let f be a holomorphic new cusp form of level N coprime with D, trivial 
nebentypus and weight 2k: 


I@= > ea" 


m>1 


Depending on how the primes dividing N split in K, the Gross—Zagier formula 
expresses the central value at s = k (or the derivative there) of the Rankin—Selberg 
L-function 


Lis, f; W) = L(Qs — 2k +1, X-p) So am(f)ru(m)m~ 


m>1 


in terms of an intersection/height pairing of the f-isotypic component ey, s of a 
cycle ew living in some Hecke module M = M,.v: Denoting this pairing by (-, -) iv 
and the Petersson inner product on $2; (N) by 


(te) = f for@y* 
Yo(N) y 


where Yo(NV) denotes the open modular curve T9(V)\H, one has 


LOCK, fv 
age ee = (ew, ews) mM (1) 


(ff) 


for some constant cx,x > 0 and the order of derivative i = ix. is 0 or | (depending 
on the sign of the functional equation). Originally the formula was proven as follows 
(for i = 0): let Mox(N) (resp. S2,(N)) denote the space of holomorphic forms 
(resp. cusp forms) of weight 2k level N and trivial nebentypus. The map 


fr Lk, f,¥), 
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being linear on S2,(N), can be represented by a kernel f tb (f Gw) for some 
Gy € Mz,(N) (same for the first derivative). By the Rankin-Selberg theory 


dxdy 
2 


LAW =f (Oge@ Exi@yr— 


for a suitable holomorphic Eisenstein series E2,—, of weight 2k — 1. The determina- 
tion of Gy amounts to first taking the trace from level N’ = /cm(4, N) to N, and 
then computing the projection of gw (z) Eox—1(z) on M2x(N). This can be done, and 
one infers from the computation of the Fourier expansion of gw(z) £2—1(z) that the 
Fourier coefficients a,,(Gw) of Gy are relatively elementary expressions involving 
the arithmetical functions ry and variants thereof: see below for an example. One 
the other hand, using the theory of complex multiplication, Gross and Zagier, and 
subsequently other people, showed by an auxiliary computation that 


Gu(2) = ao(Gu) + D> (Tew, ew) ug”, 


m>1 


where 7; denotes the m-th Hecke operator acting on the module M. The final 
result then follows from a formal argument involving the multiplicity one theo- 
rem. The main observation underlying this paper is that the above computation 
provides formally an expression for the average of the central values L(k, f, W). 
Namely, if F2,(N) denotes the set of arithmetically normalized new forms, then 
{f/f f) 7} ex.) May be completed to an orthonormal basis of Sx (N). Then 
decomposing Gy along such an orthonormal basis, and taking the m-th Fourier 
coefficient in the above decomposition, one deduces, for any m > 1, 


L(k, fLv 
ee anf) = Am(Gw) + Aoia(m) + Axis(™), 


SEF (N) 


where Aoia(m), resp. Agis(™m), is the contribution from the old forms, resp. the 
Eisenstein series, of weight 2k and level N. In principle, the Eisenstein series 
contribution could be evaluated explicitly, while the old forms contribution could 
be computed by induction on N by following the same scheme, although there is 
an added complication of finding a suitable orthonormal basis. We shall consider 
here the nicest possible situation in which these additional contributions have a 
particularly simple expression, in fact where the old part vanishes! Therefore we 
obtain, by the first step of the proof of the Gross—Zagier type formulae, a simple 
expression for the first moment 


L(k, f.¥) 
ys ———a(f). 
fEFx(N) (ff) 


Let us now turn to a more specific statement. 
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For K = Q(V/—D), denote by Ox its ring of integers, by Pic(Ox) its ideal class 
group, 
h = hx = |Pic(Ox)|, 


its class number, and 
u= |OR/{+}I. 


Given any ideal class group character W, recall that we have set form > | 


ry(m) = > W(a); 


N(a)=m 


for YW = 1x the trivial character, we shall also denote r;, (m) by 


R(m) = riy(m) = 1. 
aCOx 
N(a)=m 


We extend the definition of ry(m) to m = 0 by setting 


ry(0) = 4° ifW A lx 
Ww — 
h/2u, if V = 1x. 


We also set 


oy(m) = d. 
d\m 
(d,N)=1 
Specializing the formulas of Gross—Zagier and Gross [11, 12], and averaging over 
newforms /f, we obtain 


Theorem 1. Let —D < 0 be an odd fundamental discriminant; let N be a prime 
which is inert in K = Q(V—D) and let k = 1 be an integer. For V a character of 
Pic(Ox), and for any positive integer m, we have the following exact identity: 


90k = 2D Fw Lk, f,¥) 


(any cep om) 


SEF x(N) 


mD 


9: N 
on(m) + um!ry(mD)h am 2 ®,(n,V,N). (2) 


= Od 


n=1 


Here 


O(n, VN) = d((n, D))5(V)R(n)rv(mD — nN) P11 — oy? 
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with Py, denoting the (k — 1)-th Legendre polynomial; d(n) is the number of 
divisors of n; 6 € {0,1} is 1 if and only if (k, V) = (1, 1K); 6:(Y) € {0, 1} is 1 if 
D is prime, and when D is composite, it is | if and only if V2 = 1x and there exist 
ideals a,b, of respective norms mD —NnN and n, such that, for a prime ideal q of 
norm q congruent to —N mod D, the class of abq is a square in Pic(Ox). 


An asymptotic formula involving the average on the left was first established for 
k = 1, WV = 1x by W. Duke ((6]), which spurred a lot of other work, including that 
of Iwaniec and Sarnak ([19]) relating it to the problem of Siegel zeros for L(s, y_p). 
In the work of the second named author with J. Rogawski ((32]), a different proof 
of Duke’s result was given (for all weights), using Jacquet’s relative trace formula 
involving the integration of the kernel over the square of the split torus, and in 
addition, the intervening measure was identified. 

It is important to note that one obtains a stability theorem when N is sufficiently 
large compared with D and m, and this could perhaps be considered the most 
unexpected consequence of our approach. Indeed, when N > mD, the sum on 
the far right of the identity furnished by Theorem | becomes zero, and our exact 
average simplifies as follows: 


Corollary 1 (Stability). With the above notations and assumptions, suppose, more- 
over, that N > mD. Then one has 


20k = 21D ee Lk $Y) (/) 
(47) f€ FRAN) ee) 
2 
= —$ pon Gn) + um*pry(mD)h. 


We call the range N >mD the stable range. As one can check with other 
instances of the Gross—Zagier formulas, such as for the derivative in the case of 
odd order of vanishing, this phenomenon appears to be quite general. It has been 
recently generalized to Hilbert modular forms of square-free level by B. Feigon and 
D. Whitehouse ([9]), using the relative trace formula, now by integrating the kernel 
over a non-split torus; they are also able to treat more general characters W. 


When W = Ix, we have the factorization 


Lis, f 1x) = Lis, tk) = Lis, J)L(s, f &® X-p), 


where fx denotes the base change of f to K, L(s, f) the Hecke L-function of f, 
and f @ y~—p the twist of f by yp. Thus form = 1 and N > D, we get the 
following explicit identity involving the class number of K: 


2(2k —2)!D"/2u Lk, f)L(k, f ® x-v) _ k (: _3 12h ) 
(420)? (Af) 7 u(N —1))° 


SEF x (N) 
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In the weight 2 case, as N is taken to be a prime here, the cardinality of F2(N) is 
just the genus go(NV) of the compactification Xo(NV) of Yo(V). It is amusing to note 
that when go(V) is zero one finds that 


_ (V- Iu 
~ 42? 


h 


implying that h = 1 when (—D, N) is (—3,5), (—7, 13), (—8, 13) or (—11, 13), 
agreeing with known data. Similarly, Xo(11) is an elliptic curve E/Q, and if we 
denote by E_p the —D-twist of E, we see, for D = 3, that the algebraic special 
value A(1, £)A(1, E_3) is just 1/5. (We recall that the algebraic special value 
A(1, E) of an elliptic curve E over Q is the ratio of L(1, E) by the real fundamental 
period of E.) In general one gets more complicated identities, involving average 
central values, which are all compatible with the Birch and Swinnerton-Dyer 
conjecture for E, E_p, and the Shafarevich—Tate groups Sh(£), Sh(E_p). 


1.1 Application to the subconvexity problem 


We will now discuss some simple applications of the above exact average formula, 
the first one being a subconvex estimate for the central values L(k, f, V). We refer 
to [20] for a general discussion on the subconvexity problem. By the work of 
Waldspurger, the central value L(k, f, V) is non-negative and the convexity bound 
is given by 

0< Ltk, f. V) «. (KND)*kN'?D'?, 


for any ¢ > 0. We prove here. 


Corollary 2 (Subconvexity). Preserve the notations of Theorem 1. Then for any 
€ > 0, we have 


1 N'/2 
N}1/2 = DW2"° 


L(k, f. VW) Xe (KDN)°KN'? D'/?( 
In particular this improves on convexity as long as 
(kD)’ < N < D(kD)~* 


for some fixed 6 > 0. 


Note that this breaks convexity when JN is in a suitable range depending on k, D 
in which case the bound is subconvex in all parameters: such bounds are called 
hybrid. We refer to [29] of other hybrid subconvex bounds valid in quite general 
circumstances. The present bound however does not seem to be implied by [29]. 
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At this point we do not know of any application of these subconvex estimates, 
but we are intrigued by them because they come for free and seem to be hard to 
prove with the current methods of analytic number theory (e.g., see [5,24]). Note 
also that such bounds are fundamentally limited to the critical center s = k. Fora 
generalization to the Hilbert modular case, where W is allowed to be any ray class 
character, see [9]. 


1.2 Application to non-vanishing problems 


Another line of application addresses the existence of f for which L(k, f, V) does 
not vanish. Indeed several variants of such problems have been considered in the 
past by various methods [6, 19,23, 30,37]. Here we obtain non-vanishing results that 
are valid with a fairly large uniformity in the parameters, and again such uniformity 
seems hard to achieve by purely analytic methods. 


Theorem 2. Preserve the hypotheses of Theorem 1. Suppose further that 
Looe 


for some 6 > 0. Then there exists f € F2(N) such that 


L(k, f, V) £0. 
The same conclusion holds as long as N > D and eitherk £1 or UV # 1x. 


When WY = Ix, we also obtain a non-vanishing result in a somewhat greater 
range: 


Theorem 3. Suppose WV = 1x, k = 1 and 


N-1 
1 © 


h< 


Then there exist f such that 


Lk, fyL(k, f ® x-p) #9. 


Non-vanishing theorems of this kind, with an explicit dependence between N 
and D (like N > D or N — 1 > 12h), are of some interest. For instance, in the 
paper [27], Merel needs to consider the following problem: Given a prime p and 
a character y of conductor p which is not even and quadratic, does there exist an 
Ff € Fa(p) such that L(, f ® x) # 0? In the appendix of that paper, the first 
named author and E. Kowalski prove that this is the case when p is greater than 
an explicit but very large number. In particular, it has so far not been possible to 
answer the problem numerically in the finitely many remaining cases; this has been 
answered, however, for p < 1000 [26]. 
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Closer to the main concern of the present paper, Ellenberg [7,8] uses analytic 
methods to prove the non-vanishing of the twisted L-function L(1, f ® y~4) for 
some f in F2(N) for N of the form p? or 2p? (p an odd prime) and with prescribed 
eigenvalues at the Atkin—Lehner operators w2, w,, subject to an explicit lower bound 
on p. Ellenberg concludes from this the non-existence of primitive integral solutions 
to the generalized Fermat equation A* + B? = C? as long as p > 211; that this 
equation has only a finite number of primitive solutions is a theorem of Darmon and 
Granville. (Since this article was written, there has been further progress in “The 
Diophantine equation A* + 2¢ B? = C”” by M.A. Bennett, J. Ellenberg, and N.C. 
Ng, to appear in Int. J. Number Theory.) Another related set of examples is in the 
work of Dieulefait and Urroz ([4]). In a sequel to this paper under preparation ([28]), 
we will develop a suitable generalization of the exact average formula to a class of 
composite levels N, and investigate similar questions by modifying the method. 
This extension is subtle for three reasons: N is not square-free, D is not odd, and 
N, D are not relatively prime. 


1.3 Nonvanishing modulo p 


The exactness of the Gross—Zagier formulae even enable us to obtain average non- 
vanishing results for the algebraic part of the L(k, f, V) modulo suitable primes p. 
Again, such a question has been considered in the past, see for example [2, 37]. 
However, these earlier works addressed the question of the existence of the non- 
vanishing of L(k, f, Y) mod p when the form /f is fixed and when the character W 
varies. Here our results go in the other direction as we fix p and let N and f vary. 
Given f € Fo,(N) and gy as above, we denote by L*2(k, f, V) the algebraic part 
of L(k, f, V) (see Section 6, (11), for a precise definition). It follows from the work 
of Shimura that L"!®(k, f, W) is an algebraic number satisfying the reciprocity law 


L8(k, f, W)° = L®®(k, f°, W") 


for any o automorphism of C [33]. 


Theorem 4. Let p > 2k +1 be aprime, let P be a chosen place in Q above p, and 
let N, D be as in Theorem 1. Suppose, moreover, that p does not divide h = h_p, 
that N > D, and that N is greater than some absolute constant. Then there exists 


Ft € Fox(N) such that 
L"8(k, f, VW) £0 (mod P). 
The question of the integrality of L*®(k, f, W) is quite subtle, and our result only 


concerns the numerator of the L-value. When V = 1x, we also prove the following 
variant: 
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Theorem 5. Notations and assumptions as in Theorem 4. Suppose moreover that 
W = 1andN > pD. Then there exists f € Fox(N) such that 


VB 2) LE: f)L(k, f ® x-p) 


(ff) a,(f) £0 (mod P*!), 


The assertion makes sense because the left-hand side is (see Section 6.1) a p-unit 
times a,(f) times L"8(k, f, 1x). 


There are two fundamental periods ct (f) and c~ (f) associated to f such that 
for any Dirichlet character v, the special value L"8(k, f @-v), defined as L(k, 
f ® v)/c8"—))( f) times a simple factor (see Section 6, (12)) is an algebraic 
number. One gets the near-factorization 


np L8(k, fix) = L°(k, f)L"8(k, f ® x-p), 


where nf is essentially the order of the congruence module considered by Hida, 
Wiles, Taylor, Flach, Diamond, and others, which measures the congruences f has 
with other modular forms modulo p. The needed non-divisibility properties of 7 ¢ 
(for suitable p~) are understood (at least) if f is ordinary or k = 1. Now finally, let 
us suppose we are in the classical weight 2 situation, ie., with VW = 1x andk = 1. 


Theorem 6. Let p an odd prime not dividing Dh_p, with D odd. Then there exist 
infinitely many newforms of f of prime level N and weight 2 such that 


= f @ xv) 
num 
nf 


) # 0 (mod p), 


where 1) ¢ is the order of the congruence module of f. 


See Section 6 for a discussion of 1, which measures the congruences that f 
may have with other modular forms of the same weight and level. An analogue of 
Theorem 6 should also hold in a suitable range of p, for forms of higher weight, and 
this question will be taken up elsewhere. 


1.4 Acknowledgements 


Serge Lang always conveyed infectious excitement about mathematics to anyone 
he came into contact with, and he will be missed. He was quite interested in the 
values of L-functions and in the divisibility properties of arithmetic invariants, and 
it is a pleasure to dedicate this article to him. We would like to thank B. Gross for 
discussions concerning the occurrence of a factorial in Theorems 5.5, 5.6 of [11], 
and also M. Flach, H. Hida, K. Prasanna, V. Vatsal, and D. Whitehouse for helpful 
conversations concerning parts of the paper. Thanks are also due to Paul Nelson and 
the anonymous referee for reading the paper carefully and finding a number of typos 
and small inaccuracies. 


446 P. Michel and D. Ramakrishnan 


The first author would like to thank Caltech for its hospitality during the 
preparation of this work, and acknowledge the partial support he received from 
the ERC advanced research grant no. 228304. The second author was partially 
supported by the National Science Foundation through the grants DMS-0402044 
and DMS-0701089. 


2 The weight 2 case 


It may be instructive to explain why the exact average formula holds in the weight 
2 case when Y = 1. Let B be a quaternion division algebra over Q, ramified only at 
(a prime) N and oo, with maximal order O. Let Y be the associated rational curve 
such that Aut(Y) = B*/Q*. Put 


XSB VY xB Oo" SU" 5 


T,\Y, 


where BX = [[’B; and OX = [[ O%, with each I; being a finite group. Then 
Pp 


Pp 

Pic(X) is isomorphic to Z" with natural basis {e),@2,...,@n}, where each e; is 
the class of [';\Y. Since N is inert in K = Q(/—D), there is an embedding 
Ff € Hom(K,B) = Y(K). It results in certain Heegner points, 


x = BX\BX(f,b) eX 


of discriminant —D, with b € B* / O*. The set of these Heegner points is acted on 
transitively by Pic(Ox), and for A € Pic(Ox) and x a Heegner point, we denote by 
x, the corresponding translate; let 


>) as 
A 


for A running over ideal classes of K. For any weight 2 eigenform /, let c¢ denote 
the f-component c. Then by a beautiful theorem of B. Gross ({12, Prop.11.2]), 
providing an analogue for the L-value of the Gross-Zagier theorem for the first 
derivative, one has 


w/D LU, f)LU, f ® x-p) 
817 (Tid) 


where (-,-) x is a natural height pairing on Pic(X ). We have by orthogonality, 


(cp.cf)x = 


(c, Tmc)x = (cz, Tmce)x + ) {c¢, Tme sf) x, 
r 
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where T,,, is the operator corresponding to the m-th Hecke operator on M2(N), f 
runs over newforms in M)(N), and £ denotes the unique (holomorphic) Eisenstein 
series (of weight 2 and level V). Using the fact that f and E are Hecke eigenforms, 


and that (cz,cz)y = a (cf. [12, (11.7)]), we get, by averaging Gross’s formula, 


JD LO, ALA. f @ xv) _ he 
S72 dX (7, f) on (m)— + (ec, TC) x 
One has 
(c, Tinc) x =a XB, TinXAB)x 
A B 
and ([{12, Prop. 10.8]) 
mD/N 
Yo (xB. TnXaB)x = uhryg(m) + > ra(mD —nN)d((n, D))Ry_way (0); 
B n=1 


here r4(n) is the number of Ox-ideals of norm n in the class A and Ry_y4}(1) 
is a variant of r4(n) whose definition is recalled in (3) below; when D is prime, 
Ry_ ya} (1) is just r4(n). 
The assertion of Theorem | now follows by summing over A € Pic(Ox). 
Moreover, when m D is less than N, S°(xg, TinXag)x simply equals uh R4(m), and 
B 


this furnishes Corollary 1 (stability) in the weight 2 case. 


3 Proof of the main identity for all 2k > 2 
3.1 Preliminaries 


For N > 1, let Mo,(N) (resp S24 (N)) denote, as usual, the space of holomorphic 
modular forms (resp. cuspforms) of weight 2k, level N and trivial character. For 
ff € Mx(N), we write the Fourier expansion at the infinite cusp as 


S() = ss Gal tq" .@ = expCxiz). 


m>=0 


We denote by Fx, (NV), the set of cuspidal newforms f (normalized in the usual way, 
so that the first Fourier coefficient a;(f) is 1.) Whenever it converges, we denote 
the Petersson inner product on Mo; (N) by 


oe / for@y*# 2” 
Yo(N) oy, 
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With notations as in the introduction, define, for any ideal class A € Pic(Ox), 


{a C Ox, N(a)=m,ae A}| ifm>1 
1 


r4a(m) = 
‘ ifm = 0. 


Qu 


We also need a slight variant Ry_ v4} (7m) defined as follows: given g a prime such 
that g = —N(mod D), when q splits and writing qOx = qq, we set 


Re_-way(n) = {7 : NZ) = 1, qAT € Pic(Ox)}]. (3) 
Observe that this definition is independent of the choice of g and that when D is 


prime, Ry_va}(72) is just R4(n). 
The theta series 


84(2) = D> ra(m)q”, q = exp(2z12) 


m>0 


is a modular form of weight 1, level D and central character y_p. Moreover, for 
any WV € Pic(Ox), put 


Ou(z) = D> ¥(A)Oa(2), 
A 
whose Fourier coefficients are then given by 


dm (Ov) = S> W(A)ra(m) = rg(m) = ru(m). 
A 


In particular, the constant term ao(@y) equals - >>, YA), which is, by orthogo- 
nality, zero if and only if Y ~ 1x, when Oy is a cusp form. Setting 


Lis, f,A) = LC +208 -&), 4-0) — 
m>1 
with 


LO)(s, yp) X—p(n) 


ns 


n>1 
(n,N)=1 
one has 


Lis. f¥)= > W(A)L(s, fA). 


AeéPic(OxK) 
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Define a holomorphic function G4 on the upper half plane 1, invariant under z > 
z+ 1, by means of its Fourier expansion at infinity: 


Gi@y= Yona”, (4) 
m=0 
where 
h 
bm.a = m*~'—ra(mD) (5) 
u 
mD/N 
a 2nN 
oe 2 5(n)ra(mD —nN)Re_way(n) Pr-1 (1 = ) : 


n=1 


In this definition, wu and R(n) = }°,r4(”) are as in the introduction, Ry_way as 
in (3), 6(”) is 1 (resp. 2) if (m, D) = 1 (resp. ¥ 1), and for r > 0, P, is the r-th 
Legendre polynomial defined by 


[r/2] 
1 y m{T 2r—2m r—2m 
P,(x) = a (-1) () ( ” )x 2 


m=1 


The following result, due to Gross and Zagier, is crucial here: 


Theorem 7. Gy is a modular form of weight 2k, level N, and trivial character; it 
is cuspidal if k > 1, and for every newform f of weight 2k and level N, we have 


(4x)** 


Nei 2(2k — 2)!D'/2 


(f, Ga). 


For k = 1, see [12, p.291, Prop. 9.1], and for general k, this is in [11, Thm (5.6), 
p. 291]. 


3.2. Correction of a small error in [11], Theorem 5.6 


We take this opportunity to correct a small error in [11]: The factor (A — 1)! should 
not be present in the numerator of the right-hand side of the second formula of 
Theorem (5.6) on p. 291 of [11]; it is “canceled” by the (k — 1)! term in the 
denominator on the right-hand side of the formula in Prop. (4.4) on p. 283 of loc. 
cit. This does not affect anything concerning the weight 2 case (when k = 1), which 
was the main thrust of [11]. Even for general k, it is of little consequence when k 
is fixed. It is however crucial for us, especially since we would like to study the 
behavior on average when k becomes large, and so we give a detailed discussion 
here. 
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To begin, in the statement of Theorem 5.5 (on page 290 of [11]), the formula 
should read 


1)" Gay" 92k-1 e(N)r! 
(Qk—2—r)! (2k —2)! |DPEr2 


x (FY Brg”), () 


La(f2k-1-r)= 


with 
m|D|/N 
bnr = YS) ra(m|D|—nN)Px-(Nn,m|D|)ox-2,-2,A (0). 


n=0 


The only difference from the formula of Gross—Zagier is the appearance of (2k — 
2—r)! in the denominator, instead of what they have in [11], namely (2k —2—2r)!. 

To see that (*) is the right formula, note that by Proposition 1.2 of [11], which is 
used in the proof (see p. 272), 


(4) ** NST (5 + 2k — 1) LAC 2k —1—r) = (f, Os). 


To get Theorem 5.5 (of [11]), Gross and Zagier apply their Proposition 1.2 with 
s = —r, and so, using P(2k —1—r) = (2k —2—r)!, one obtains 


_ (42)?k-!—" Nv 2 
LAU fi2k =l=r) = Qk -2=ny fh P~): (#*) 


By Corollary 3.4 on page 281 (of loc. cit.), 


es) 
_,(z) = », tingtye™, 


m=0 
where 
mé/N 
Ams(y) = > ens (y)ra(mé — nN), 
n=0 


with e,,(y) being given by the formulae on the top of page 282. 
When r = 0, the form ®_,(z) is holomorphic. For k > 1 andr > 0, we may 
apply Proposition 5.1 on page 288 (of [11]) to conclude that 


(f.0--) = (fOr), 


where ®_,. is the holomorphic projection given by 


CO 
@_,(z) = ae 


m=1 
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with 


(4am)! 


(2k — 2)! 


io) 
any = i am,r Qe ya: 
0 
Working through the calculation of Gross and Zagier on the bottom of page 289 
and the top of page 290, making use of Prop. 5.1, Cor. 3.4 and Prop. 4.4, we get 
exactly the same expression for the coefficients of ®_, as Gross and Zagier do on 
page 290, namely, 


(1) 2s rte 
(2k = 2)!N"|D |2k#-r-1/2 


ane = mrs 


with 
m|D|/N 
bmx = >) ra(m|D| —nN)Py(Nn, m|D|)or%-2-2,a(0), 
n=0 
where P;.-(x, y), resp. 02¢,4 (7), is given by (5.3), resp. (5.4), on page 290. 
It is important to pause and note that the expression for by, correctly involves 
O2k—-2r—2,A (1) (and not o24—;—2,4(1)). The source of this is that e,,() (occurring 


in the definition of ay,,(y)) is e_-(, MY) e2aNny/S (see the top of page 282 of [11]), 
and for n 4 0, we have from page 279, 


e,(n, y) _ if L (s)y *-***2Y, (ny), 


where 


3 Ep, (md2)ep, (n/méz) 


(mS y)25+2k—-2 


Lis) = => en,(-N) x0,0,(A) 


D=D,D2,D2|n m|n/52,m>0 


Thus L,(s) is a sum of Dirichlet series at the argument 2s + 2k — 2, and we are 
letting s = —r. 
In any case, from the expression above for ad», we get 


Uf . r _ Gl en yrie 
— (2k — 2)!N"| D/P? 1/2 


(> bao): 


Plugging this in (**), canceling N”, and consolidating the powers of 27, we get 
(*) as asserted. 

These calculations extend to the case r = k — 1, when we get a fortuitous 
cancelation as the r! in the numerator and the (2k — 2 — r)! in the denominator 
are the same in (*). If one had (2k — 2 — 2r)! in the denominator, it would become 
1 when r = k — 1, and this is why the formula in Theorem 5.6 (on page 291) has 
a spurious (kK — 1)! in the numerator. Again, this was not of major concern to Gross 
and Zagier as they were mainly considering k = 1 with a dramatic application to 


452 P. Michel and D. Ramakrishnan 


elliptic curves. It is amusing that it also doesn’t matter for the & = 2 example Gross 
and Zagier work out at the bottom of page 291. We care only because the appearance 
of the unwanted (A — 1)! in the numerator would contradict the general RH. 

It follows, after the simplifications Gross and Zagier indicate in [11], that 
Theorem 5.6 holds when the expression for L.4(f,k) is corrected to be the 
following: 


(Qor)* 27-1 


LAL) = GET pie 


(f > bm,Aq"), 


where 5,4 is as in the beginning of Theorem 5.6 of loc. cit. 


3.3 The exact average formula 


Let 
lo) 
E= Eon = So an(E)q" 


n=0 
denote a holomorphic Eisenstein series for [o(N) of weight 2. Since N is prime, 
the modular curve Yo() has only two cusps, namely oo and 0. It then follows that 
E is unique up to scalar multiple, and so E(z)/do(£) is well defined with constant 
term | at oo. To be specific, we will take 


£@ = XS) + Sovimy” 
y= 12 On\MN)q , 


m=1 


where oy (m) = abd Ni d. 
For A € Pic(Ox), with G4 being as in the previous section, put 


boa 


Ga? (2) = Ga(2) — Sea=1 ao(E) 


E(z). (6) 


Then G{;"” is a holomorphic cuspform of level N, weight 2k, and trivial character, 
with coefficients a,(Gy). 

Lemma 3.1. For —D an odd fundamental discriminant and N a prime inert in K, 
we have, for anym = 1, 


20k=2ip' L(k, f, A) 
(Airy >. (f f) an(f ) 


fEFx(N) 


cus) b , 
= an(Go) = bm,A — Bem amt). 


In order to prove this, we first need the following 
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Lemma 3.2. Assume that N is a prime that is inert in K = Q(/—D). Let f be 
any old form in So(N). Then we have, for every A € Pic(Ox), 


(f.Gy'") = 0. 


Such a lemma will not in general hold for composite NV. 


3.4 Proof of Lemma 3.2 


There is nothing to prove when k < 6, since S$2(1) is zero in that case. Suppose 
that k > 6; since f is cuspidal, it suffices to prove that (f, G4) = 0. Put 


Gu:= > W(A)Ga. 


Ae€Pic(Ox) 


It is sufficient to show that (f/Gw) = 0 for all ideal class characters W of K. If 


f = 721 an(S)q", put 


Dis, f x Bu) = L(+ 2(8— k). xv) Qs IMO) 


n=1 


Then the Rankin-Selberg method ([11, IV, §1]) gives the identity 
(40) *T(k)D{k, f x Ov) = (f. Tryon OvE2k—1.)) (8) 


where &,—,y is the result of slashing a holomorphic Eisenstein series of weight 
2k — 1 (and character y_p) with the Atkin involution wy, and Tryp;p denotes 
the trace from M>,(ND) to M>;.(N), where M,(M) denotes the space of non- 
holomorphic modular forms of level M and weight r. Moreover, the calculations 
of Gross and Zagier ([11, IV, §5]) show that Gy is the holomorphic projection of 
Tryp/n (OwE2x-1,n), SO that 


(f, Gu) = (f. Tenpyn (OwE2k-1,N))- 


Let f be a newform of level 1 (and weight 2k). Then since N is prime, it defines 
two old forms of level NV, namely f\(z) = f(z) and fy(z) = (Nz), so that 
Am(fn) is zero unless N |m, and damn (fv) = Gm(S). Since the new and old forms 
are orthogonal to each other under (-, -), and since the space of old forms of level V 
are spanned by { fu,d = 1, N} with f running over all the cuspforms of level 1, it 
suffices to prove that each D(k, fa x Ow) = 0. For d = 1, one has 


x-D(N) 


D(s, fi X Ov) = Cee eit sou): (9) 
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This reduces to checking the vanishing of the right-hand side. Since f has level 1, 
the root number of L(s, f x @y) is —1, yielding the requisite vanishing. When 
d=N, Dik, fo x Oy) is still a multiple of L(k, f x Ow) and is therefore zero. O 


3.5 Proof of Lemma 3.1 


We may choose an orthogonal basis B of S.,(N) to be of the form Fx, (N) U B’, 
where B’ consists of old forms. Clearly we have 


ae G,”) cusp 
Esa ee ee (10) 
Xu (ha) . 


In view of Lemma 3.2, the sum on the left-hand side needs to run only over 
newforms f. Applying Theorem 7, and using (10), we obtain 


2(2k —2)!D'/? L(k, fA as 
“ae an Pee 
FEF 2 (N) ; 
The lemma now follows by taking the m-th coefficient of the above identity. Oo 


3.6 Proof of Theorem 1 


The exact average formula follows by performing the averaging process 
» seric(ox) Y(A) --. on both sides of the formula in Lemma 3.1, using the formula 
(5) for the coefficients b,,.4, and by noting that 


An(E) 12 
a(E) N-1 


on (m) 


and that, when k = 1, bo. 4 = a O 


2u2* 


4 Subconvex Bounds 


In this section, we prove Corollary 2. By the work of Waldspurger, Guo and Jacquet 
[13,38] (see also [22] for V = 1x), 


L(k, f, V) = 0. 
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Thus from formula (2) form = 1, we have 


I0k=2)'1D'? LK, LY) “ hey 
(41)?k et) ~ # 


Since | P,—1(x)| < 1 for |x| < 1 and R(x), |ru(n)| < d(n), so that 


|O(n, ¥, NY]. 


R(n)|rw(D —nN)| < d(n)? + d(D — nN)’, 


we see that the n-sum on the right side is bounded by 2 (log D)?. From the class 
number formula, we have 


h< D'logD 
and 


(f. f) « (42) (2k — 1)!N (log N)P 


as follows from [21, (2.31)]. Unlike the corresponding bound for Maass forms 
([18]), this upper bound is elementary since f is holomorphic and by Deligne its 
Fourier coefficients satisfy the Ramanujan—Petersson bound. Thus we see that 


L({k, f. V) « (logkN log D)?k(N + py), 


Oo 
5 Application to non-vanishing 
We prove here Theorem 2. Arguing exactly as above we have 
2(2k — 2)!D1/? Lik, f.¥) ih h/u)? D 
(2k ~ 2) KAW) hb  s6Ch/W? | 6 (D agg py: 
(4) Gf) «  N-1 N 


SEF x (N) 


h D 3 
= a O (joe) ). 


By Siegel’s theorem, which gives h = D!/?+°), we see that the right side is 
positive as soon as N > D!/?+5 for some 5 > 0. If N > D, then we are in the 
stable range and we have 


= / 
2(2k — 2)1D1? BELO AF (-) ay 


(47) (ff) 4 


SEF x (N) 


When 6 = 0, this concludes the proof of Theorem 2 since h > 1. Oo 
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Suppose now that 6 = 1 (1.e.,k = 1, YW = 1x). Then we remark that 


M I 


®,(n, 1,N) = 0, 
n=1 

so that 

DN Lk, f¥) Sh (: a wey 

Bm” rerun) (PF) ~ om ae 
completing the proof of Theorem 3. O 
6 Non-vanishing mod p 
6.1 Algebraic Parts of L-values 
Let us put 

L'8(k, f.U) = Chany he DF eo) EO, (12) 


where g(¥—p) = )°x(moap) X—D (x) exp(2zi 7) is the Gauss sum. Then it is known 


by Shimura ({33], see also [16]), that L*8(k, f, y) is an algebraic number obeying 
the reciprocity law: 


LE fy U) = LER LU, 


for every automorphism o of C. 

Next recall that for ¥ = 1x, we have the factorization L(k, f, VY) = L(k, f) 
L(k, f ® y~p). For any Dirichlet character v, the algebraic part of L(k, f ® v) is 
given by 

L({k, f @v) 
(—2mi)¥e+(f) 
where ci(/) is a fundamental period of f, with + = v(—1). Again, one has for 
any automorphism o of C, L*£(k, f? @ v’) is L*8(k, f @ v)°. 

This leads to the near-factorization 


Li8(k, f @v) = g(v)(k —1)! (13) 


nL Pk; f.1x) = L™(&, SL, 7 @ xn); (14) 


where 7 ¢ equals — thanks to a series of papers of Hida (cf. [16], [17]), Wiles ([39]), 
Taylor—Wiles ((35]), and Diamond—Flach—Guo ([3]) — the order of the congruence 
module of f, i.e., the number which counts the congruences of f with other 
modular forms of the same weight and level. 
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6.2 Proof of Theorems 4 and 5 


From the definition of the algebraic part, the hypothesis of Theorem 4 and the 
formula (11), used in conjunction with 6 = 0, we have (up to multiplication by 
a p-unit) 


Yo Lek, f¥) = - 
SEF ox(N) 


The conclusion of Theorem 4 is immediate. 
For the proof of Theorem 5, we have, assuming that N > pD, 


ey. 


tie z(t — 81 


SEF x (N) 


Therefore the conclusion holds except possibly if p|(1 — 120M) Suppose we are 
in that latter case. Then we apply the exact formula of Corollary 1 with m = p and 
get 


S2 Lk, flap) = 7 (R@)- Ol) p +0). 


SEF x (N) 


R(p) is either 0 or 2, if it is zero, then the left-hand side of the previous formula 


is not divisible by p. If R(p) = 2, then 2 — volo ” is not divisible by p since by 
12¢h 


assumption p|(1 — i ae). So we are done in al cases. O 


6.3 Proof of Theorem 6 


Here we are restricting to the weight 2 case, and by the theory of modular symbols, 
cf. Stevens [34] and Vatsal [36] (see also Prasanna [31]), we know that for any 
Dirichlet character v, the special value L®(1, f @ v) is integral except possibly at 
the Eisenstein primes; these are the primes dividing 


N :=|[¢@-D, 


q|N 


which is related to the order of the cuspidal divisor class group, studied for modular 
curves, among others, by Kubert and Lang. 

We may, and we will, choose N to lie in the infinite family of primes that are 
inert in K and are such that p + N. 

Now Theorem 6 follows by the near-factorization (14) of L*2(1, f, 1x). It may 
be useful to note that when f has Q-coefficients, with associated elliptic curve E 
over Q, one knows (cf. Flach [10]) that any prime dividing np also divides the 
degree of the modular parametrization Xo(NV) > E. 
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Abstract In 1976, Serge Lang and Hale Trotter formulated general conjectures 
about the value distribution of traces of Frobenius automorphisms acting on an 
elliptic curve. In this paper, we study a modular analog. More precisely, we consider 
the distribution of values of Fourier coefficients of Hecke eigenforms of weight 
k>4. 
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1 Introduction 


Let E be an elliptic curve over a number field K. If p is a prime of Ox and E has 
good reduction at p, denote by ay(£) the integer 


Np+1—-|E(F,)|. 
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In 1976, Lang and Trotter [4] formulated some conjectures about how often dp(F) 
takes a fixed value. More precisely, they conjectured that there is a constant Cr q 
(possibly zero) such that for x — oo, 


Te a(x) := #p: Np < xanda,(E) =a} ~ a 
log x 
provided we are in the generic case, that is, a # 0 or E does not have complex 
multiplication. The constant cz, depends on the Galois representation attached to 
E.1n 1981, Serre [13] proved that for any € > 0, 


Tea(x) _ x/(log ayer, 


in the generic case. The exponent 5/4 was improved to 2 by Daqing Wan [17]. A 
further refinement was obtained by the second author in [5] where it is shown that 
x (log log x)? 

4 a xX <K a Pa aa 
zal) (log x)? 
The case dp(£) = O corresponds to EF having supersingular reduction at p. A 
classical result of Deuring shows that if E has complex multiplication by an order 
in an imaginary quadratic field F’,, the set of supersingular primes of K has density 
1/2 if F is not contained in K and zero if F C K. If E does not have complex 
multiplication, then Elkies, Kaneko, and R. Murty (see [1]) showed that 


me o(x) « x*!4, 


Recently, R. Taylor has announced the meromorphic continuation of symmetric 
power L-series attached to F (in the case that K is totally real and FE has 
multiplicative reduction at some prime p). It is conjectured that these symmetric 
power L-functions extend to entire functions. If we assume this, together with an 
analogue of the Riemann hypothesis for them, K. Murty [6] has shown that 


TE a(x) « x*/4 


ifa # Oor E does not have CM. A substantial generalization and reinterpretation of 
the Lang—Trotter conjecture can be found in [7], where a more general formulation 
in terms of Galois representations is made. 

In this paper, we consider a normalized Hecke eigenform of weight k > 4 for the 
full modular group. We write 


f(2 _ soe 
n=1 


for its Fourier expansion at ico. The field K ¢ generated by the values A (1) as n 
ranges over all positive integers is of finite degree over Q. We write O for the ring 
of integers of K ¢. In an earlier paper [9], we showed that if a € O¢ is coprime to 2, 
then the number of solutions of the equation 


Af(n) =a (1) 
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is bounded. Moreover, there is an effectively computable constant c = c(a) > 0 
such that all solutions 7 of the equation satisfy 


n < exp(|N(@)|*), 


where N(q) is the norm of a from K ¢ to Q. This means that for any given q, all the 
solutions of (1) can be effectively determined. If, in addition, we assume the abc 
conjecture for the number field K , then it was shown that the exponential bound 
can be improved to a polynomial bound of the form c;|N(a)|°, for some constant 
c, > 0 and the same c as before. In the special case of the Ramanujan t-function, 
we deduced that the number of solutions of the equation t(n) = a with a odd is 
finite, a result obtained earlier in our joint work with Shorey [11]. Our methods are 
sufficiently versatile to be applied to related problems. For example, in [10], we 
study the greatest prime ideal factor of the ideal generated by A /(p”) for fixed p 
and varying using similar techniques. 
In this paper, we want to study the number v (a) of solutions of the equation 


INAs‘(™))| =a 


for a given natural number a. We prove the following Theorem: 


Theorem 1 Let f be a normalized Hecke eigenform of weight k > 4 for the full 
modular group. Assume the abc conjecture for K ¢. Let d = [Kr : Q]. Then, for 
any € > 0, 


/ 
2 v¢(a) < x2/dk-3) te 


a<x 


where the dash on the summation indicates that we sum over odd, positive a. 
We immediately deduce the following corollary: 


Corollary 2 For any normalized Hecke eigenform f of weight k > 4 for the full 
modular group, 


ve (a) < q7/dk—3) +e ; 
provided a is odd and the abc conjecture holds for K ;. 


What is interesting about this corollary is that it is consistent with the Atkin—Serre 
conjecture (see p.244 of [14]). This conjecture predicts that if f is of weight k > 4 
and is not of CM type, then for sufficiently large primes p, 


|A¢(p)| > pe OP, (2) 


As (2) is conjectured to hold for all conjugates f° of f, it implies that 


d(k 


INQ ¢(p))| >> p= 
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and so 


vp(a) < |al7Te, 

As was shown in [9], A ¢(p) is divisible by 2 for all odd primes p in the level-one 
case. This is a key fact, since it implies that for w coprime to 2, the equation A (m) = 
a forces n to be a perfect square (see [9]). Thus, Theorem | can be extended to 
higher levels, provided this property holds for all sufficiently large primes. Indeed, 
Ono and Taguchi [12] have shown that this is the case for all forms of level 2“ No 
with a arbitrary and No = 1,3,5,15, or 17. We record this observation in the 
following. 


Theorem 3 Let f be a normalized Hecke eigenform of weight k > 4 and level N. 
Suppose that for all primes sufficiently large, A ¢(p) is divisible by 2. Assuming the 
abc conjecture for K ¢, we have for any € > 0, 


/ 
> vp (a) <K x2/dk—3) te | 
asx 
where the dash on the summation indicates that we sum over a coprime to 2 and 


d =([Ky : Q. 


Acknowledgements We would like to thank the referee for useful comments on an earlier version 
of this paper. 


2 Preliminaries 


We begin by reviewing results proved in an earlier paper [9]. 


Proposition 4 Let f be a normalized cuspidal eigenform of weight k > 4 and 
level N. There is an effectively computable constant c, > 0 such that form => 2 and 
every prime p, we have 


Kl (m—cy log m) 


m k 
Ar (p")| = lvp (p.m) |p ? 
where y¢(p,m) = 1 ifm is even and d ¢(p) if m is odd. 
Proof. This is Proposition 2.2 of [9]. oO 


In particular, we see from this proposition that A ¢(p’") 4 0 when m is even and 
sufficiently large. 


Proposition 5 Let f be a Hecke eigenform of weight k and level N. Then, for all 
Pp sufficiently large, either )¢(p) = 0 or A¢(p*) F 0 for all a > 1. Moreover, 
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for each m, there is a binary form fn of degree [m/2], with integeral coefficients 
such that 


Aye") = Ap irre) Pp’): 


Proof. The first part of the assertion follows from the previous proposition or from 
Lemma 2.3 of [9]. The second part follows from the proof of the same lemma. The 
binary form fin (x, y) is 


[m/2] 
I] (x — 4y cos?(xr/(m + 1))), 


r=1 


which is easily seen to have integer coefficients by simple field-theoretic considera- 
tions. Oo 


We will also have need of a version of Roth’s theorem, which we record in the 
following lemma. 


Lemma 6 (Roth’s theorem) Let f be a binary form with integer coefficients and 
degree d > 3. If f has distinct irrational roots, then, 


| f(x, y)| >> max(|x|, ly|)*-2-«, 
where the implied constant depends only on the coefficients of f . 


Proof. This essentially follows from Roth’s theorem. See also [8]. oO 


A number-field version of this lemma will also be needed in the later sections, 
and this will be recalled in Section 4. 

Our line of argument has its origins in [9] and [11]. In [11], it was observed that 
the Ramanujan t-function has the fortuitous property that t(p) is even for every 
prime p. By an analogue of Proposition 5 for the t-function, we see that t(p””) is 
even for every odd m. Hence, if we are interested in the equation 


t(n)=a 


for a odd, it follows that 1 must be a perfect square, by virtue of the multiplicativity 
of t. This was the key fact that enabled the application of results from Baker’s 
theory to establish that the number of solutions to the equation t(n) = a, with a 
odd, is finite. This argument was extended to any normalized eigenform for the full 
modular group in [9]. As indicated in [9], results of Tate [15] imply that A ¢(p) is 
divisible by 2 for every prime p. This enabled us to extend the results of [11] to the 
full modular case. As indicated in [9], the method can be generalized to arbitrary 
level provided that A (p) is divisible by 2 for all primes p sufficiently large. With 
this background information in place, we now outline our basic strategy. 
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We fix a positive integer a coprime to 2 and study the equation 
|N(A ¢())| = a. 


As A ¢(n) is multiplicative, we see that A ¢(p’’) is coprime to 2 for p’”||n. Now 
suppose that A ¢(p) is divisible by 2 for all primes p > co. Then by Proposition 5, 
we see that 1 ¢(p’”) is divisible by 2 for all odd m and p > co. Thus, if we write 
nN = NoN\N2, where the prime factors of n; are < co satisfying A¢(p) A 0, the 
prime factors p of no are < co with A¢(p) = 0, and the prime factors of nz are 
> co, then we see that 1 is a perfect square. For primes p|n;, we have p < co and 
A ¢(p) # 0, so that Proposition 4 shows that 


lA ¢(p")| = ly¢(p.m)|p 2 


This means that 1; is bounded, since the primes and prime powers that divide it are 
bounded. If we look at no, then A ¢(p) = 0 for each p|no. Since p”||n, m must be 
even, for otherwise A f(n) = 0. Thus, no is a perfect square. In any case, n has the 
form ab? with a, b coprime and a bounded and A ; (b*) 4 0. Thus, we are motivated 
to study the Dirichlet series 


[o) 


Drs) = So INAS@)I>, 


n=1 


where the dash in the summation means we go over those n such that A ¢(n?) 4 0. 
Since A ¢(n7) is multiplicative, we may write this as an Euler product: 


cuam IT bs rEEC ON wear: 


m=0 


where the dash on the product indicates we go over primes p such that A ¢(p””) 4 0 
for any m > 0. Our objective is to determine a half-plane in which this series 
converges absolutely. 

We remark that if the series 


vz (a) 
Qa’ 


Me 


a=1 


converges absolutely for Ni(s) > c, then 


Devea) K Di veayer/nyr* Kx, 


n<x nsx 


for any € > 0. We will use this remark in our discussion below. 
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Let us note also that as 
INA ¢(n?))| <n d(n?), 


where d(n) denotes the number of divisors of 1, the series does not converge for 


‘ 1 
Ks) < d(k—1) 


Moreover, as D ; (s) is a Dirichlet series with non-negative coefficients, it must have 
a singularity at its abscissa of convergence, by a celebrated theorem of Landau. In 
particular, we have 


Svea) = OG"): 


a<x 


3 The special case of Ramanujan’s t-function 


For the sake of clarity, we will first consider a special case, namely, the study of the 
Dirichlet series 


adh 1 
Dy(s) = » a 


Since t(n7) is a multiplicative function, we can expand the series as an infinite 
product over the primes: 


Da(s)=]] » rr") 


Pp m=0 


Our goal is to determine a region of convergence for this series. By Proposition 4, 
we see that 


|c(p””)| > ai 


form => mo (say). This means that the series 


a letp2")| ho < x gy nee) 


m>mo m>mo 


converges for ‘t(s) > 0. To deal with the other part of the series, we need to estimate 
T( p?") for 2 < m < mo. We can use Propostion 5 combined with Roth’s theorem to 
derive a lower bound for |t(p””)| for 6 < m < mp. Indeed, Roth’s theorem allows 
us to deduce that 


leer Sp er. 
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We need to discuss lower bounds for t(p7) and t(p*). For this, we need to invoke 
the abc conjecture. To this end, let us define the radical of a natural number n, 
denoted by rad(7), to be the product of the distinct primes dividing n. The abc 
conjecture predicts that for any two coprime integers a, b, 


rad(ab(a + b)) > max(|a|, |b|)'~*, 


for any € > 0. The implied constant will depend on € but not on a, b. 


Lemma 7 Suppose that t(p) 4 0. The abc conjecture implies that for any € > 0, 
z(p*)| > per 


and 
[c(p*)| > pi. 


Proof. We first apply the abc conjecture to the equation 
t(p?) = t(p)’ — p". 
Suppose first that p is coprime to t(p). From the abc conjecture, we deduce that 
rad(t(p)°t(p?) p'') > p29. 
Using |t(p)| < 2p!'/?, we obtain 
|z(p?)| = rad(|r(p?)]) > pO, 


as desired. If p|t(p), write t(p) = p“v, with v, coprime to p. As t(p”) # 0, we 
deduce that 


rad(v’, p'74 (v7, _ py > Qo, 
so that 
t(p’) = pew, = pi) > | ala 


This completes the proof of the first part. For the second part, consider 


(2t(p)? — 3p!!)? = 4r(p*) — 5p”. 


Assuming first that p is coprime to t(p), we can apply the abc conjecture to this 
equation to deduce 


[ep > pee". 
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If p|t(p), then we write, as before, t(p) = p“v, with v, coprime to p. Then, we 
have 


4t(p*) = ed (C2 _ an 4 5p, 


Applying the abc conjecture to the term in the square brackets, we obtain 
|c(p*)| > gre , 


so that the result is proved in this case also. Oo 


We are now in a position to study the convergence of 


deen. 


m<mo 


We break the sum into three parts: 


F020) ied 2020) i Se 0 


3<m<mo 


By our earlier discussion, the last sum is bounded by p~**"*©), By the previous 
lemma, the first two terms are 


< po 28-O Rs) 


This result immediately implies that D,(s) converges for W(s) > 2/9. Thus, 


! 
va(a) <K 4 2/9tE 


a<x 


We record the following corollary for its own intrinsic interest. 


Corollary 8 [fa is an odd number, the number of solutions of t(n) = a is bounded 
by O(\a|?/°**), assuming the abc conjecture. 


4 The abc conjecture for number fields 


Let K be an algebraic number field. Suppose a,b,c € K* such thata+b+c=0 
Define 


radx (a,b,c) = I] Nxjo(P). 
p 
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where the product is over those prime ideals for which the numbers 


llallp» [lPllp. llellp 


are unequal. We will also write rad (a) to be the product of norms of the distinct 
prime ideal divisors of (a). We define 


Hx(a,b,c) = | [ max(\lally, |[blly, lelh), 


where the product is over all valuations of K (both finite and infinite and we 
normalize the archimedean valuations by ||x||, = |x|% with d, = 1 or 2 
according as v is real or complex, and the nonarchimedean valuations by ||x||, = 
Nx /o(p)~"). The abc conjecture for K is the following assertion. For any € > 0, 
there is a constant Cx. such that 


Hx(a,b,c) < Cx,<(radx (a, b,c))'**. 
A stronger version predicts that one may replace Cx. by 
C [K:Q] De : 


where Dx is the absolute value of the discriminant of K. We will not be using this 
stronger version of the abc conjecture in our discussion below. We refer the reader 
to Vojta [16] for further details. 

We first derive a consequence of the abc conjecture for number fields that will 
be applied in the subsequent discussion. 


Lemma 9 Let K be an algebraic number field and suppose that} = gcd ((a), (b)). 
Suppose for all finite primes , ||a||p A ||b||p Assuming the abc conjecture for K, 
we have 


rad(a)rad(b)rad(a + b)/(rad(@))? >> (max(|N(a)|, |N(b)|, [N(a + by) /N(0)?) 
where N stands for Nx/qQ and the implied constant depends on K and €. 
Proof. Suppose first that 0 = 1. From the definition, we have 


radx(a,b,a+b)= I] N(p), 
plab(a+b) 


since a,b, (a + b) are mutually coprime. Let us note that for every finite v, we also 
have that one of 


llallv. [2 lh. [la + lly, 
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is 1, so that 
Ax(a,b,a + b) = max(|N(a)|, |N(b)|, |N(a + 5))). 


The abc conjecture now implies the result in this case. If 0 # 1, let p be a prime 
ideal dividing 0. By our assumption, p enters into the radical. N(p) enters three times 
into the product rad(a)rad(b)rad(a + 6), and to remove two of the occurences, we 
can divide by N(p)*. This completes the proof. Oo 


In our estimations below, we will need a number field version of Lemma 6, and 
this we record here. 


Lemma 10 Let K be an algebraic number field and f a binary form in Ox|x, y] 
with no repeated factors. Then, assuming the abc conjecture for K, we have 


radx(f(u,v)) > Hx(u,v)o*, 


where d is the degree of f andu,v € K*. 
Proof. This is proved on page 105 of [2]. oO 


We remark that if we replace radx (f(u, v)) by | f(u, v)|, this is essentially Roth’s 
theorem for number fields. Thus, the abc conjecture is making a stronger assertion 
than that implied by Roth’s theorem. Indeed, since |N(f(u,v))| > radx(f(u, v)), 
we deduce the following: 


Corollary 11 Let K be an algebraic number field and f a binary form in Ox |x, y]. 
Then, 


IN(f(u, v))] > He (u, vy, 
where d is the degree of f and u,v € K*, assuming the abc conjecture for K. 
Lemma 12 Suppose that A ¢(p) 4 0. Assume the abc conjecture for K ¢. Then, 


IN(A ¢(p?))| > p**-9-€ 


and 
INA ¢(p*))| > pe, 
where d = [Kf : Q| and p is unramified in K ,. 


Proof. As before, we apply the abc conjecture to the equation 
Ap(p?) = Ag(py — pe. 


First suppose that A ¢(p) and p are coprime. By Lemma 9 applied to the field K ;, 
we obtain 


radx (A ¢(p)”, ps1, Ap (p?)) > pee, 
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where d = [Ky : Q]. We obtain 


p“|IN(A¢(p)) NA ¢(p2))| > phe, 


from which we deduce, using the Ramanujan bound |N(A ¢(p))| < 24 p@*—0?, 
that 

INA ¢(p*))| > pA? 
Now suppose that p“||(A ¢(p)), with a > 1. Then by taking norms, we obtain the 
inequality 

pit < ple-/2, 

implying a < (k — 1)/2. Since k is even, this is a strict inequality. Thus, a < 
(k — 1)/2. Since p is unramified, 


op = NQ@) F [lA ¢)? Ilp = ND). 
By Lemma 9, we obtain as before, 
[NA ¢(p?))| Se peer, 


The lower bound for |N(A ¢(p*))| is derived similarly. We apply the abc conjecture 
to the equation 


(2A ¢(p)? — 3p*!? = ACh) = Sp". 7 


5 The Dirichlet series D ; (s) 


We will now study the series Dy(s) and determine where it converges. Since 
N(A r(n?)) is multiplicative, we have the Euler product 


_ 1 
>s)=T] (> cca) | 


Pp m= 


Our goal is to determine the region where the Euler product converges absolutely. 
We split the product into two parts: p < co and p > co, for which we have that 
A ¢(p) is divisible by 2. The first product is finite and is over those p for which 
the A ¢(p”) are all coprime to 2. This product converges for }i(s) > 0. Let us 
now consider the other product. We proceed as in the case of the t-function. By 
Proposition 4, we see that for m > mo (say), 


[A ¢(p?")| > geen, 
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A similar estimate holds with f replaced by any conjugate form f%. Thus the series 
in the Euler product converges for i(s) > 0 if we restrict m > mo. By Corollary 
11, we have 


[pez Q@y 2 > pee 


for 6 < m < 2mo. Thus, 
Leia ee 
for 3 < m < mo. We deduce that 
INA p(p™))| > pe DA, 


for 3 < m < mo. To complete our estimates, we need lower bounds for |A ¢(p?)| 
and |A ¢(p*)|, which are provided by Lemma 12. From that lemma, we get that 


INA p(p*))| > PMP", INA p(p*))| > ph. 
Putting all this together shows the following: 


Theorem 13 Assume the abc conjecture for Kr. Letd = [Ky : Q|. Then, the 
Dirichlet series D ¢(s) converges absolutely for R(s) > 2/d(k — 3). In particular, 


A 
Ss vp(a) K x2/dk-3)te 


a<x 


for any € > 0, where the summation is over odd, positive a. 
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Multiplicity estimates, interpolation, 
and transcendence theory 


Michael Nakamaye 


In loving memory of my mentor and friend Serge Lang 


Abstract We discuss the problems of interpolation and multiplicity estimates on 
compactifications of commutative algebraic groups. We consider two extremal 
cases: one where multiplicity is imposed at a single point and the other where 
the conditions are imposed on an asymptotically growing set of points. Some 
conjectures and new results are given in both cases. 
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1 Introduction 


One of Serge Lang’s most memorable qualities was his incredible vision of 
mathematical structures. Whether studying diophantine approximation, rational 
points on algebraic varieties, or the far-reaching consequences of the heat kernel 
in number theory, Serge always had a simple, elegant vision of how, and often 
why, the mathematical universe behaves, what objects one might hope to find, and 
how to go about finding them. In this article we would like, to the best of our 
ability, to adopt a similarly broad point of view and study the global geometric 
properties of algebraic varieties which govern the behavior of multiplicity estimates 
and interpolation estimates on them. 

Suppose that G is a commutative algebraic group of dimension d defined over 
the complex numbers. Let X be an equivariant compactification of G and A an 
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ample line bundle on X. If I” C G isa finite set, then we let 
T(S) = {x +... + xs : x; € I foralli}. 


For simplicity, we will assume that I” contains the identity eg of G. Interpolation 
asks to find the maximal order jets which a multiple of A can simultaneously 
generate at the points of \(S). Multiplicity estimates, on the other hand, seek the 
maximal order to which a non-zero section of a multiple of A can vanish at the set 
of points J*(S). We will consider only the two extreme cases, where ” = eg and 
where I” contains non-torsion elements of G and S is large. 

We consider first the case where I” is reduced to a point. In this case, the fact 
that X is an equivariant compactification of a group variety is irrelevant, except 
insofar as the open subset G C X is a homogeneous space for the action of G, and 
consequently eg € G C X may be viewed as a very general point. Suppose then 
that X is a smooth projective variety and 7 € X is a general point. We now introduce 
formally the two numerical invariants of the pair (X, A) which will be studied. 


Definition 1.1.1. Suppose X is a smooth projective variety, x € X, and A an ample 
line bundle on X. Then 


Gia oO, 
csx mult, (C) 


here the infimum runs over all integral curves C C X passing through x. 


The invariant €(x, A) is called the Seshadri constant of A at x. The reader is 
encouraged to consult [L], Volume I, Chapter 5 for the basic properties of Seshadri 
constants, the geometric information they encode, as well as some very interesting 
problems, both solved and unsolved, regarding Seshadri constants. This numerical 
definition is equivalent (see [L] Theorem 5.1.17 for details) to a more intuitive 
geometric definition. In particular, e€(x, A) is the supremum of all non-negative 
rational numbers @ such that the linear series |n A| separates na-jets at x for n 
sufficiently large and divisible. Any irreducible subvariety V C X, of positive 
dimension and maximal with respect to the relation of inclusion, satisfying 


deg,(V) am 
(se) =e) 


is called Seshadri exceptional at x relative to A. An important result of Campana 
and Peternell [CP] asserts that a Seshadri exceptional subvariety always exists. Note 
finally that solving the interpolation problem for X and A at a point 7 is equivalent 
to bounding the Seshadri constant €(7, A) from below. 

Next, for an ample divisor A on X we let 


m(A) = sup {mult,(D) | DE Div(X)@Q effective} : 
D=A 
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here = denotes numerical equivalence and 7 is a very general point of X. To see 
that the invariant m(A) is well-defined, choose a very ample divisor B. For any 
Q-divisor D = A, we can choose general divisors B,,..., Bg—; from the linear 
system |B|, containing 7, so that the intersection 


support(D)N By N---N Bg-} 
is proper. It follows that 
mult,(D) < (A) Ne (B)*1, 


establishing that the supremum exists in the definition of m(A). 
The two invariants m(A) and €(n, A) are closely related. Indeed, as we will 
establish at the end of §1.2 


e(n, A) < Vdeg4(X) < m(A) (1) 


with equality holding on the one side if and only if it holds on the other. More 
generally, the smaller €(7, A) is the more sparse will be the jets generated by |k A| 
at 7, and consequently the larger m(A) will be. We can quantify this relationship 
precisely on a surface. 


Theorem 1.1.2. Suppose X is a smooth surface and n € X a very general point. 
Then for any ample line bundle A on X we have 


seea(%) < m(A)e(n, A) < deg 4(X). 


For higher dimensional varieties, the analogue of Theorem 1.1.2 requires the 
introduction of new invariants. 


Definition 1.1.3. For 1 <i < d let 


a;(A) = sup {a €Q: dim (Bs kA ® mia 


ja for k > ok; 


here BS denotes the base locus and m, C Oy the maximal ideal associated to the 
point 7. 

Note that a@|(A) = €(y, A) and ag(A) = m(A). As in Theorem 1.1.2 one can 
establish the upper bound 


d 
| [ai(4) < deg y(X). 


i=1 


A lower bound is harder to come by, however, since the proof of Theorem 1.1.2 
uses in a fundamental way in order to establish (4) below the fact that the Seshadri 


exceptional curve C;, is a divisor. Along the lines of Theorem 1.1.2, we make the 
following conjecture: 
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Conjecture 1.1.4. Suppose A is an ample line bundle on a smooth projective 
variety X of dimension d. Then 


Te (A) > » Seat) 


i=1 


As in the case of surfaces, Conjecture 1.1.4 is the best possible result. We will see 
that Conjecture 1.1.4 is true, as in the surface case, if there is a divisorial Seshadri 
exceptional subvariety of A at 7. The interest of Conjecture 1.1.4 is that it gives a 
connection between m(A) and €(n, A). In particular, Conjecture 1.1.4 says that if 
m(A) is small, then €(7, A) cannot be too small. Thus any upper bound on m(A) 
entails a lower bound on €(n, A) and, conversely, if €(7, A) is small, then m(A) 
cannot be too small. 


We will now consider non-zero values of the parameter S. Here the most refined 
results are due to Philippon [P1,P2]. From the geometric viewpoint, the one manner 
in which Philippon’s zero estimates might be strengthened is to distinguish the 
cases where S' is small from those where S is large. Indeed, these two cases are 
qualitatively very different. When S = 1, since no hypothesis is made concerning 
the finite set 7 C G, no meaningful conclusion can be drawn without imposing 
vanishing along I(r) for r > 1. For example, if G = A? and I consists of 
1,000,000 points on a smooth conic C C G, then of course there is a regular 
function of degree two, namely the function defining the conic C, which vanishes 
to order one along all 1,000,000 points. Meanwhile, if the 1,000,000 points were 
general points of G, there would be no polynomial of degree smaller than roughly 
2,000,000 vanishing at each of the points. 

Thus the main goal, in this case, of Philippon’s construction is to replace I” with 
I’(2), thus allowing the group law on A? to spread the points on C around. If, 
however, as may sometimes be assumed in applications, the set along which one 
imposes vanishing is already of the form [°(S) for S > 1, then a much stronger 
result can be expected to hold since regardless of the position of the points of I’, 
they have already been distributed by the group law on G. 

In order to evaluate how well the points of I”(S') will be distributed inside G, we 
will use a line of reasoning which grows out of the work of Faltings and Wiistholz 
[FW] on the Schmidt subspace theorem. 


Definition 1.1.5. Suppose H C G is a non-trivial subgroup. Let I"(oo) be the 
subgroup of G generated by I” and set 


p(#7) = rank(I"(oo) N A). 
Setting d(H) = dim(/7) we also define 


p(H) 


vd, H) = d(H) 
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A subgroup H is called the ’-exceptional subgroup if v(I’, H) > v(I, A’) for all 
non-trivial subgroups H’ C G and if H is maximal among such subgroups. 


The same notion is defined, with the notation *(I, H), and used in a similar 
fashion by Masser [M] and, not suprisingly given the relationship with [FW], it 
is closely related to a second invariant used by Masser and Wiistholz [MW] and 
Waldschmidt [Wal] to study related diophantine problems. In §2 we will show that a 
I-exceptional subgroup H always exists and, moreover, that there is an increasing 
sequence of subgroups 


A, C::-C AH, 


with H, the [’-exceptional subgroup, H, = G, and v(I, H,) >... > v(I, A,). 
Moreover if H C G is a subgroup with v(H) = v(H;) then, H C Hj. 
In order to study the generation of jets along ’(S) when S is large, we need 
a notion of Seshadri exceptional subvariety for the set of points [(S). First, the 
Seshadri constant of A along a finite subset Z C X is defined by 
€(Z, A) = int = ee 
C (> ez mult, (C) 


where the infimum is taken over all irreducible curves C C X containing at least 
one point of Z. An irreducible subvariety V C X of positive dimension is called 
Seshadri exceptional for A relative to Z if 


(= ec 


yez mult, (V) 


and if V is maximal with respect to the relation of inclusion. 


Theorem 1.1.6. Suppose the I’-exceptional subgroup H C G is proper. Then for 
all S sufficiently large, a translate of each Seshadri exceptional subvariety of A 
relative to I'(S) is contained in H. 


Theorem 1.1.6 is nearly optimal but once S is sufficiently large one might ask 
whether or not there are criteria for when the subgroup H is the Seshadri exceptional 
subvariety: 


Question 1.1.7. With hypotheses as in Theorem 1.1.6, is the subgroup H_ the 
Seshadri exceptional subvariety for A relative to '(S) for all S sufficiently large? 


Quantitatively, an affirmative answer to Question 1.1.7 leads to an improvement 
of interpolation estimates involving factors of dim(G) and the rank of J”, as we 
will see in Theorem 1.5.1 below. Qualitatively, however, an affirmative answer to 
Question 1.1.7 is much stronger than Theorem 1.5.1 as it identifies the obstruction 
to the interpolation problem in terms of the subgroups of G. 
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We are left to address the interpolation question in the case where 
v(I,G) > v7, A) 
for all subgroups H C G. In this case, there are no obstruction subgroups and thus 


the points of [°(S) are well distributed with respect to subgroups of G. 


Conjecture 1.1.8. Suppose X is the compactification of a commutative algebraic 
group G and I" C G is not contained in Gtors. Suppose G is I -exceptional. Then 
for all S sufficiently large 


a/dega(X) 
IP(S)| 


e(M(S), A) >= 


More generally, it is reasonable to look for the best possible lower bound for 


tia e("(S), A) VIP(S)| 
im inf : 
Soo i/deg ,(X) 


and attempt to characterize when the limit is close to 1. If the limit were always 1, 
then Question 1.1.7 would have an affirmative answer, in asymptotic form at least, 
without the hypothesis that the ["-exceptional subgroup H is a proper subgroup 
of G: in other words, as S becomes larger and larger the Seshadri constant for A 
relative to I(S) is closer and closer to maximal and thus X itself is becoming closer 
and closer to being Seshadri exceptional. 


Finally we would like to address zero estimates when the parameter S is large. 
Leth; = dim(H;) for 1 <i <r. As in Definition 1.1.3 we also introduce a constant 


to measure the dimension of the base locus of H® (kA @ Tke 


ri ) for different values 
ofa:forl<i<r 


a(S, A) = sup {a > 0: dim (Bs (4% ® Tits,)) <h; for some k > 0} : 


Only the dimensions of the subgroups H; are considered in the definition of 
a;(S, A) since, conjecturally, when S >> 0 only translates of these subgroups will 
appear as obstructions: 


Conjecture 1.1.9. Suppose X is the compactification of a commutative group 
variety and I” is not contained in Gis. Suppose G is not I’-exceptional so that 
r > 2. Then for 1 < j <r—land1 > «> 0 there exists S sufficiently large so 


that 
K(a ;4.4(S,A)—e) 
BS (kA @ Ip" ) 
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consists of translates of Hy, where 0 <k < j. Moreover, given0 < B <1, for each 
1< j <r-—JlandforallS sufficiently large, 


(a@j410S, A) — a; (S, A))°icard('(|S*]) ++ H;/H;) deg ,(H;) < deg ,(X). 


where c; = codim(H;, G) and | $8 | is the greatest integer less than or equal to S?. 


Both parts of Conjecture 1.1.9 can be cleaned up and strengthened by enlarging 
the set [°(S) and this will be examined in §2. As stated, the degree inequality 
of Conjecture 1.1.9 would be an improvement of joint work with Ratazzi [NR]. 
The arguments of the article [NR] prove the best known results in the direction of 
Conjecture 1.1.9. 


The organization of the paper is as follows. In §1 we prove Theorem 1.1.2 and 
its analogue in the higher dimensional case when there is a divisorial Seshadri 
exceptional subvariety for A at 7. In §2 we discuss the basic properties of the 
invariant v(I’, H) and the filtration which it induces on G. We then study one 
example of this filtration carefully and relate this to Conjecture 1.1.9. In §3 we 
prove Theorem 1.1.6 while §4 is devoted to a discussion of Conjecture 1.1.8 and 
Conjecture 1.1.9. In particular, Conjecture 1.1.8 can be established up to explicit 
factors involving dim(G) and p(G) and similarly for Conjecture 1.1.9, except that 
we cannot necessarily isolate each subgroup H; and thus the inequality obtained 
involves an unknown subgroup containing H;. Due to space constraints, we can 
only sketch the main ideas of the arguments in some cases but we have tried to 
include all important ideas. One aspect of this story which we do not have time to 
address here but which, particularly for applications in transcendence theory, merits 
serious attention is the issue of giving effective bounds, as a function of S, for zero 
estimates and interpolation estimates. 


Acknowledgements As is clear from the testimony in §5, my greatest mathematical debt is to 
Serge Lang and it is with great affection that I dedicate this work to his memory. Others to whom I 
am endebted for many discussions involving the material of this paper are D. Bertrand, S. Fischler, 
P. Philippon, and N. Ratazzi. 


1.2 Surfaces 


Proof of Theorem 1.1.2. 
One of the inequalities of Theorem 1.1.2 is trivial, namely 


m(A)e(n, A) < deg ,(X). 


Indeed, choose an effective Q-divisor D = A with multiplicity at least m(A) — 6 
at 7. Choose a general effective Q-divisor E = A with multiplicity at least 
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€(n, A)—6é at 7. By definition of e(7, A) we may assume that D and E meet properly 
at 7 and, by Bézout’s theorem, 


(e(n, A) — 8)(m(A) — 8) < deg ,(X). 


Allowing 6 to approach zero establishes the upper bound on m(A)e(y, A). 

For the lower bound we use the counting methods of [N2] and a key observation 
of [EKL] also used in [N2]. Suppose that C, is a Seshadri exceptional curve for 
A at n: if the Seshadri exceptional subvariety were X itself, then we would be in 
the situation where €(n, A) = m(A) = ap deg ,(X) and there would be nothing to 
prove. 


Suppose D € kA ® mia. Then D vanishes along C;, if a > e€(y, A). In fact, 


using [N2] Lemma 1.3, which is in turn a variant of [EKL] Proposition 2.3, we have 


multe, (D) = ka —ke(n, A) — 1. (2) 
We now define a sequence of vector spaces as follows. For each integer a > 0 let 
Va = H°(X,kA @ mt /mi*"), 
We have the trivial upper bound 
dim(V,) <a +1, 1<a< [ke(yn, A) +1]. (3) 
On the other hand, for a > [ke(7n, A) + 1] we can apply (2) to conclude that 
multc,(D) = a — [ke(n, A) + 1]. 


In a worst case scenario, from the point of view of the counting, C;, is smooth at 7. 
In this case, writing 


D = (a— [ke(n, A) + 1])C + D’, 
and then removing the multiple of C gives an inclusion 
Va CH (X,kA = (a = [ke(n, A) + 11) C @ m[be +N Jpn leet 21) | 
In particular, for alla > [ke(n, A) + 1], 
dim(V,) < ke(n, A) + 2. (4) 


We have 


no (x, kA® mi) = h(X, kA) — > no (x, kA® m',/mi,*!) 


UT] 
i=0 
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Applying (3) for each 1 <i < [ke(n, A) + 1], and (4) for eachi > [ke(y, A) +2] 
we find, up to factors of O(k), 


2 
sate — (ka — ke(n, A))ke(n, A) 


no (x, kA® mi) = h°(X, kA) — 
ke(n, A)? 


= h°(X,kA) — k?ae(n, A) + 5 


By Riemann-Roch h°(X, kA) = ate FES) + O(k). In particular, for k >> 0 we see 
that h° (x, kA® mi) > 0, and consequently m(A) > a, as long as 


e(n, A)” _ deg ,(X) 
ae(n, A) — = < = , 


Thus we conclude that 


deg ,(X) 4 €(n, A) 
2e(n, A) 2 © 


m(A) = 


This concludes the proof of Theorem 1.1.2. Note that when e€(7, A) is small the 
lower bound becomes sharper. 

Note that Theorem 1.1.2 is the best possible result of this type. Indeed, the right- 
hand side can be an equality as we see by taking X¥ = P? and A = Op2(1). On the 
other hand the left-hand side becomes closer and closer to being an equality with 
X = P! x P! and A = O(a,b) witha < b. Indeed, in this case m(A) = a + b 
and €(y, A) = a with Seshadri exceptional subvariety being P! x m2(n). We have 
deg ,(X) = 2ab and thus oe = ab is only slightly less than m(A)e(n, A) = 
ab +a. According to [N1] this scenario only arises when the Seshadri exceptional 
curve is a fibre of a surjective map mz : X — Y where Y is acurve. 

We now discuss the analogue of Theorem 1.1.2 for the case where dim(X) > 2 
and where there is a Seshadri exceptional divisor D, of A relative to (S) at 7. 
In this case we have direct analogues of the three key estimates (2), (3), and (4). 


Suppose D € H® (x, A® mie), Then we have as before 
multp,(D) = ka —ke(n, A) — 1. (5) 
With V, defined precisely as in the surface case we have 


a+ dim(X) — 1 


ne ( dim(X) — 1 


). 1a <[he(n.a) +11. (6) 


Lastly, fora > [ke(n, A) + 1] we have 


(7) 


dim(Vi) < aa A) + 1] + dim(X) — ') . 


dim(X) — 1 
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Combining (6) and (7) we find, up to O(k¢~'), 


k4 deg4(X) k(n, A)4 k4e(n, A)?! 
= =(ko=ke(y, Ay — = 
= 7 rT (ka—ke(n, A)) (@—1)! 


h® (x, ka® mi) 


Arguing, as in the surface case, we deduce that 


deg ,(X) d-1 
A) > ————_ , A) —_. 8 
m(A) > Fo pat +l. IG (8) 
Since the Seshadri exceptional subvariety D,, is a divisor, we have a;(A) = €(n, A) 
for 1 <i <d —1 and thus (8) implies that 


d 


[Testa) = eon. ayimcay 2 SSS + nay (F). 


i=l 

To conclude this section, we prove (1) from the introduction, generalizing the 
argument given on surfaces in the first paragraph of this section. The fact that 
m(A) => €(n, A) is clear as m(A) measures the maximum asymptotic multiplicity 


of a section of KA at n, while «(7, A) measures the maximum asymptotic separation 
of jets of kA at 7. Moreover, [L] Volume I, Proposition 5.1.9 asserts that 


e(n, A) < deg ,(X). (9) 


The fact that m(A) > ¢/deg,(X) is a simple counting argument using the Riemann- 
Roch theorem on X. We will show that 


e(n, A)*~! -m(A) < deg 4(X). (10) 


Combining (9) and (10) shows that if «(7,A) = </deg,(X), then m(A) 


4/deg ,(X). The opposite implication, that if m(A) = </deg,(X), then €(7, A) = 
4/deg ,(X) is more subtle and is the content of the last section of [N2]. As for (10), 
given 6 > 0, choose k > 0 and sections 5;,...,5¢ € H°(X,kA) with the following 
properties: 


(a.) mult,(s;) = (e(7, 4) —46)k, 1<i<d—I, 
(b.) mult, (sa) > (m(A) — 4)k, 
(c.) the tangent cones of the sections s; at 7 meet properly. 


II 


Letting D; be the zeroes of s;, by (c.) the intersection D; M--- Dg is proper and by 
(a) and (b) the intersection multiplicity at 7 will be at least 


(e(n, A) — 8)! (m(A) — 8) k 4. 


Since this intersection multiplicity is at most the total degree of the intersection, 
k4 deg ,(X), (10) follows taking the limit as 5 approaches zero. 
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In order to justify why, in the language of Definition 1.1.5, there is a ’-exceptional 
subgroup H C G, suppose H;,H, C G are two distinct subgroups with 
v(I, H,) = v(Z, M2) both maximal. We may assume that neither subgroup contains 
the other. It is sufficient then to show that 


v(, A, + Ao) = v(T, Ai) = vd, Ad). (11) 
If A, M A = {0}, then d(A, + Hd) = d(H) + d(H) and p(y, + Ad) = 
p( A) + p(A2) and thus (11) is true. Thus we may assume that H; N A) is non- 
trivial. 
Since v(I, H,) and v(I, Hz) are maximal we have 
vl, Ay t+ A) < vd, A). 
If vd, A, + Ao) < vW’, Aj), then we claim that 


p(H, N Ap) . p(A)) 


; (12) 
d(—#,0 Hz) d(M) 
contradicting the maximality assumption on H. Note that 
A, + #H. Ai) + p(A2) — p( A, ON 
WT Hy + ey) = p(y; 2) 7 P(A) + p(A2) — pCi 2) (13) 
d(H, + Hx)  d(A\) + d( Hz) — d(H, 9 Ap) 
Ifa, b,c,d are positive numbers with a > c and b > d, then 
a-c a. a Cc 
< — ifandonlyif — < (14) 


b—-d b bd’ 


Combining (14) with (13) establishes (12) and concludes the proof that there is a 
unique maximal subgroup H C G with v(#7) maximal. 

Let H, be the J"-exceptional subgroup of G. If H; = G, then we are done. If not, 
let v2 be the largest value, strictly smaller than v(I, H;), such that v(I, H) = v2 
for some subgroup H C G. Among all subgroups, choose Hz maximal so that 
v(I’, Hz) = v2. As above we consider H, + Hp. If Hy N Hy = {0}; then v( Aj) > 
v(H, + Hz) > v( Ad), contradicting the choice of Hy. Thus H, M Az is non-trivial. 
By choice of H; we have v(H, NM Az) < v( Aj). If v(A, A Az) = v(A)), then (13) 
gives the contradiction 


v(A) > vA ON Az) > vA + A) 


unless H, 1 Hy = Hj. But if H;) ON Hy = Aj, then we are done since this implies 
that H, C Hp». Thus we may assume that v(H; N Hz) < v(H;) and so, by the 
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choice of Hy, v(H, N Az) < v(A2). Using (13) again will show that 
v(A, + A) > v(Ad), (15) 


unless H; 1 Hy = A. In either case we are finished since (15) contradicts the 
choice of H>. 

If H, = G we are done. If not, choose the largest value v3 < v2 so that there is a 
subgroup H3 C G with v(H3) = v3. We choose H3 maximal and arguing precisely 
as in the previous paragraph shows that H2 C H3. The chain terminates when we 
find H, = G. 

In the rest of this section, we would like, in order to motivate Conjecture 1.1.9, to 
consider a special case of G and I” where we can compute base loci explicitly. Let 
G = Ab and X = P*. Let vy; = (1, 0,0), v2 = (0, 1,0), and v3 = (0,0, 1). Choose 


I = {0,411 V1, ---. Qtr V1, 421V2, --- Aap. V2, 431V3,-.-  373V3}, 


where the aj; are algebraically independent over Q and r; >> rz > 73. In this 
scenario we have 

H, = A! x {0} x {0}, 

Ab = A! x A! x {0}, 

H; = A'xA!x Al, 


We also have 


vd, A) =n, 


v(P, Hp) = aaa 


re +2 +73 


vl, H3) = 3 


Finally, let 


I = {0,a11v1,...,@1r Vi}, 
Ty = {0, do1v2,..., A2y,V2}, 


T3 => {0, ATV 3s: 54 5 3743}. 
For this example it is useful to introduce a slightly larger set than "(S), namely 


T(S) = {ayx, +... tax, :0< a; < S foralli, x; € I}. 
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Conjectures 1.1.8 and 1.1.9 should still hold with '(S) in place of (S). Consider, 
for S > 0 fixed and k large, the linear series 


Ox(k) @ IK.) (16) 


Suppose a < (S + 1)~". We claim that the base locus of (16) is ’(S}). Choose 
coordinates x,, 2,3 on A? where x; is the coordinate from the i factor A!. Let 
mt; : A> — A be the projection to the i" factor. Note that 2;(7(S)) = I;(S). Let 
fi(x;) be a polynomial of degree k which vanishes along I";(S) to order ka. In 
order to verify that such a non-zero polynomial f;(x;) exists, note that |I7;(S)| = 
(S + 1)". The cost of vanishing to order at least ka at a point is [ka]. On the other 
hand, polynomials of degree at most k in x; form a vector space of dimension k + 1. 
Thus as long as 

fka](S +1)" <k+1 


a non-zero polynomial /;(x;) exists satisfying the required vanishing conditions. 
Using the hypothesis that r) >> rz >> r3 and looking at the common zeroes of the 
polynomials f)(x1), f2(x2), and f3(x3) shows that the base locus of (16) is exactly 
T(S) whena < (S +1). 

When a > (S'+ 1)~"! we claim that A! x { P)} x { P3} is in the base locus of (16) 
for all Py € ['2(S) and P3 € 13(S). Indeed, if o €¢ H°(X,Ox(k)) vanishes to 
order at least ka along "(S), then o|A! x { P2} x { P3} is a polynomial of degree at 
most & with order of vanishing larger than k(S + 1)~"! at (S + 1)" points, and this 
is impossible unless o|A! x { P2} x {P3} = 0. Using the functions fo(x2), f(x3) 
defined above, we see that when (S + 1)~"2 > a > (S + 1)~” the base locus of 
(16) is precisely 


e A! x {Po} x {P3}, 


Px€T2(S),P3€T3(S) 


where A! x { P3} x { P3} denotes the Zariski closure of A! x {P2} x {P3} in P?. 

Suppose now that wa > (S + 1)~”?. We claim that for P; € ”3(S) the subvariety 
A! x A! x {P3} is in the base locus of (16) for each P; € ”3(S). Suppose f is a 
polynomial of degree k vanishing to order at least [ka] at '(S). Suppose that f 
does not vanish identically along A! x A! x {P3} and let 


g = f\|Alx A! x {P3}. 
We claim that for each P; € 7, (S) g vanishes at { P,} x A! x { P3}. Indeed, if not, 


then g|{P,} x A! x {P3} has degree k and order of vanishing at least a at (S + 1)” 
points, and this is impossible. Thus g is divisible by 


h(x) => I] (x1 =— P,). 


Py el\(S) 


488 M. Nakamaye 


Consider the quotient g(x;, x2)/h(x,). This polynomial has degree k — |I"\(S)| 
and order of vanishing at least [ka] — 1 at (x, y) for x € '\(S) and y € I2(S). 
Note that the argument applied to establish that g is divisible by h applies more 
generally. Indeed, if r(x;, x2) is any polynomial with order of vanishing at least ¢ 
along ”\(S) x I"2(S), with t|I"7(S)| > deg(r), then r vanishes at { P,} x A! for 
each P, € I'\(S). Dividing g(x;, x2) by h(x) drops the degree of g by |I\(S)|, 
while the order of vanishing along ",(S) x I”7(S) has gone down by 1. Thus we 
conclude, setting r(x1,X%2) = g(X1, X2)/h(x;), that r(x, x2) is also divisible by 
h(x,) and the argument can then be repeated until we reach a contradiction. We 
conclude that fora > (S + 1)~” the base locus of (16) contains 


|) Alx Al x {P3}. 
P3€T3(S) 


Using the function /3(x3) defined above, we see that the base locus of (16) for 
(S +1)? <a < (S +1) is precisely 


[J Alx Al x {P3}. 
P3€T3(S) 


Suppose now that a > (S + 1) and that f is a polynomial of degree k 
vanishing at ["(S) to order at least ka. Using the argument of the previous case 
we will establish that f is divisible by the polynomial 


h(x) = [[ Gi- Pi). 


Piel (S) 


For P; € I'\(S) consider f|{P,} x A! x A!. This has order of vanishing at least 
a at '(S) x I3(S) and so, repeating the argument of the previous paragraph, 
f vanishes identically on {P;} x A! x A!. We then repeat the same argument for 
f(x)/h(x1) which has degree k — |I";(S)| and order of vanishing at least [ka] — 1 
along I"(S). We eventually reach a contradiction, concluding that f does not exist. 

In this particular case it is easy to verify that Conjecture 1.1.9 holds with (8) in 
place of I"(S'). Indeed, we have provided explicit polynomials, for any € > 0, with 
the specified base locus. As indicated after the statement of Conjecture 1.1.9 the 
base locus of the linear series (16) is exactly translates of H;, where j will depend 
ona. 

What happens in this example for Conjecture 1.1.9 when J"(S') is used instead 
of ’(S)? Qualitatively we will find the same result since we have the inclusions 


F(S) c F(S) c rds), 
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but the quantitative sharpness of the estimate will be lost. To better understand 
what goes wrong, we will consider the same example but with [(S) in place of 
the enlarged "(S). 

When a > (S + 1)~" the subvariety A! x {0} x {0} will enter the base locus 
precisely as before. For other values of P,P; € I(S), however, the curve A! x 
{ P2} x { P3} enters the base locus at a later point. For example, if P) = 0 and P3 = x 
for some x € I” the argument above will only establish that A! x { P2} x { P3} enters 
the base locus once a > S~"3. Thus as @ grows, more and more curves of the form 
A! x{P)}x{P3} will enter the base locus. Eventually, the surface A! x A! x {0} enters 
the base locus quickly to be followed by fibres over other points P; € I"3(S). The 
values where the base locus jumps in dimension will be comparable asymptotically 
in S, but the structure is much more involved and does not lend itself to a simple 
statement as in Conjecture 1.1.9. 


1.4 The J'-exceptional subgroup 


In this section we will prove Theorem 1.1.6. Note first that for S large, the existence 
of a proper J"-exceptional subgroup H forces the Seshadri constant e€(J"(S), A) to 
be highly submaximal. Indeed, for S' large, using the subgroup H to estimate the 
value of e(1"(S), A), we have by [L] Volume I, Proposition 5.1.9 


deg(H) yr 


e(P(S), A) < Gee an 


Recalling that py = rank(I"(0oo)N A), the right-hand side grows like O (s ~ amt ) ; 
while the maximal value possible for «(1"(S), A) is 


1 
ee] dim(G) _ PG 
= O S dim(G) . 
( [P(S)| ( ) 


Since H is I’-exceptional and is a proper subgroup of G we have 
and consequently for S large 


_PH _ PE __ 
dim(H) ~ dim(G) 


1 
deg(G) \ =@ 
c(r(5).A) « (FEO a7) 
We claim that for k sufficiently large the linear series 
dke(I'(S),A)+d 
kA @ Lr(as)—rias) (18) 
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is non-empty. Note that 7 — I” generates the same subgroup of G as I” and in 
particular |"(S) — (S)| and |°(S)| both grow like polynomials in S of degree 
pq. Thus, according to (17), for k, S >> 0 the linear series (18) is non-empty. 


We will also need to study the growth of |"(S) — ’(S)| on subgroups. Suppose 
HC Gis an irreducible non-trivial subgroup. Choose a basis x;,..., Xp, for the 
free part of I’(co). Choose r > 0 so that each non-torsion element x € I” can be 


expressed as - 
x=te+ > ax, 
i=] 
where ty € I (00)tors and |a;| < r. Let 


2G 
A(S) = | dose : lai| < s| : 


i=1 


For all S > Oif x € I(S), then there is an element y € I"(00)tor3 and z € A(S) so 
that 
x=ytZ. 


Thus we have 
|H NT(S)| < |P(oo)tors|| A CS) N P'(0o) free | 
= [1° (00)tors || nN A(rS)| 


Let px be the rank of H. Then |H M A(rS)| < (2rS + 1)?#. In particular it is 
possible to choose S' sufficiently large so that the linear series (18) is non-empty 
restricted to any subgroup H which satisfies v(H) < v(G). 


Proof of Theorem 1.1.6. Suppose that V is Seshadri exceptional for A relative to 
I(S) for some S > 0. Thus by definition 


19 
yer) mult, (V) oe) 


Let o be a non-zero section of the linear series (18). We will outline a proof that 


rae ( deg ,(V) 


o vanishes on t_,(V) +... +¢-.(V) C Z(o) with d summands (20) 


for each x € I(S). We will review here the argument to establish (20) for one and 
two summands and indicate how it extends to the general case. 

We first claim that for g € G if V is Seshadri exceptional for A relative to (S) 
then V + g is Seshadri exceptional for A relative to '(S) + g. Note that for all 
positive dimensional W c X 


deg4(g + W) _ ___deg,(W) 
Vyer(sy Multgyy(g +W) deers) multy(W) 


(21) 
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because the degree is translation invariant. As W varies over all curves, the infimum 
of the right-hand side is e("($), A) by definition, while the infimum of the left hand 
side is e(g + I(S), A) and thus 


e(I'(S), A) = e(g + F(S), A). 


Using (19) and (21) we conclude that V + g is Seshadri exceptional for A relative 
to (S) + g as claimed. 

Suppose now that x € I"(S'). Since V—x is Seshadri exceptional for A relative to 
I'(S) —x, and since o vanishes to order > ke(I"(S), A) at [(S) — F(S) it follows 
from [NR] Proposition 3.2 that o vanishes on V —x for each x € I’(S), establishing 
(20) for one summand. To establish (20) for two summands, note that for all x, y € 
I'(S), y + V — 2x is Seshadri exceptional for A relative to y + (S) — 2x. Since o 
vanishes to order > 2ke(I"(S), A) +2 along y + (SS) —2x we deduce that o must 
vanish along y + V — 2x. Using the differentiation argument of [NR] Proposition 
3.2 shows that in fact o has order of vanishing at least ke(I"(S), A) + 1 along 
I'(S)+V —2x. In particular, o vanishes along V + V —2x as desired. This process 
can be iterated, using the fact that o vanishes to order at least kde(I"(S), A) + d 
along (dS) — (dS), to establish (20). 

Choose x € '(S)NV: we know such an x exists since V is Seshadri exceptional 
for A relative to '(S). Let W = t_,(V) so that W contains eg andletW+...+W, 
with d summands, be the Zariski closure in ¥ of (WNG)+...+(WMG). Then 
from the previous paragraph we conclude that o vanishes on W + ... + W. Since 
o + (it follows that W N G is degenerate. Let Hw C G be the smallest subgroup 
containing W. Then V C x + W C x + Hy is still Seshadri exceptional for 
A|(x + Hy) relative to [(S) M x + Hy. Thus V — x is Seshadri exceptional 
relative to [(S) — x for A|H yw. If v(Hw) < v(H) then, according to (18), for S 
sufficiently large, but not depending on W, the linear series 

kAlHw ® Zreasycrias) 
is non-empty. Repeating the above argument will show that V — x is degenerate 
inside H w which is impossible by definition of Hw. Thus v(Hy) = v(#), and so 
we have Hw C H. This concludes the proof of Theorem 1.1.6 since we have shown 
that V —x C Hy. 

Note that if dim(G) = 2 then Theorem 1.1.6 implies that for all S' sufficiently 
large a translate of the [’-exceptional subgroup is Seshadri exceptional. This 
generalizes [NR] Théoréme 1.5. 


1.5. What can be shown 


The obstruction to the interpolation problem is the subgroup H where v(I, H) 
is maximal or in other words where I"(0o) MN H is most densely distributed. For 
multiplicity estimates, on the other hand, there are in general several possible 
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choices for an obstruction subgroup, namely H;,1 <i < r —1. The best choice 
among the possible obstruction subgroups, from the point of view of the inequality 
in Conjecture 1.1.9, is the subgroup H; for which a;+; — a; is largest. Stated 
qualitatively, the best choice for the obstruction to multiplicity estimates is the 
subgroup K in which I"(co) N K is most densely distributed by comparison to 
I'(co) N K’ for all subgroups K’ properly containing K. 

Here we would like to discuss what can be shown in the direction of Question 
1.1.7 and Conjectures 1.1.8 and 1.1.9. First, regarding Question 1.1.7, Theorem 
1.1.6 is, at least qualitatively, already a large step in the right direction, for even 
if the [’-exceptional subgroup H is not itself Seshadri exceptional relative to (S) 
for large S, it remains the source of the interpolation problem and provides a good 
lower bound for the interpolation exponent. Concerning Conjecture 1.1.8 we have 


Theorem 1.5.1. Suppose G is a commutative algebraic group with compactifica- 
tion X and A an ample line bundle on X. Let [ C G bea finite set and let H be 
the I’-exceptional subgroup and dy its dimension. For all S sufficiently large, 


1 { degy(H) \# 
aS a (ee) 


zr 


Similarly we have 


= 1 deg,(H) \t% 
as Gas ' a, : 


The penalty to be paid for not knowing that H is Seshadri exceptional is that 
|"(S)| must be replaced with | "(dz S)|. Otherwise this would be Conjecture 1.1.8. 


Proof of Theorem 1.5.1. Let V be the Seshadri exceptional subvariety for A relative 

to '(S) for some S > 0. The proof of Theorem 1.1.6 shows that for some 

x € I'(S) we have V Cc x + H’ where H’ C A and v(A’,T') = v(A,T). 

We assume that H’ is the smallest subgroup containing a translate of V. We 

know that V is Seshadri exceptional for A relative to /\(S) and in particular V 

is still Seshadri exceptional for A|x + H’ relative to "'(S) 9 A’. If e((S), A) < 
1 


deg 4(H aq : . : : 
x (Se) " then the same inequality holds with dy in place of dy: 


1 deg,(H) \ ar 
8). <7 (Rae am) : — 


The argument from the proof of Theorem 1.1.6, which we will repeat here without 
translations by a multiple of —x, will now provide a contradiction, giving a non- 
zero section 0 € H°(dy/x + H’,k A) which vanishes on dyx + H’ = (x + V)+ 
...(x + V) with dy summands. Note that if dy, = 1, then V = x + A’ with 
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x € I'(S). Since V is Seshadri exceptional for A relative to "(S') we have 


deg ,(x + H’) 
|P(S)N (x + A’)| 


e((S), A) = 


and this violates (22) for S >> 0. Thus we may assume that dy, > 2. 


Suppose 0 4 s € H® (2x + H',kA® a) such a section exists, 


for k >> 0, by (22). We claim that for each y € '(S)/M H’ the section s vanishes 
on 2x + y+V. If not, then, since 2x + y+ ((S)N A’) C 2x + (12S) A’), 
we have 


seH (2x ty + AKA QD). 
By hypothesis V is Seshadri exceptional for A relative to "(S), and so 2x + y+V 
is Seshadri exceptional for A|2x + y + H’ relative to 2x + y+ ('(S) 1 A’). In 
particular, s|2x + y + V = 0. The same argument will show that all derivatives 
of s up to order ke(I"(S), A) + 1 vanish along 2x + I'(S) + V. Since the order 
of vanishing of s along 2x + '(S) + V is at least ke("(S), A) + 1 we conclude, 
using again the fact that V is Seshadri exceptional for A relative to "(S), that s 
vanishes along 2x + V + V. This argument extends to show that if0 # o € 


H° (dix + H’,kA® faa then o vanishes along V +... +V 


with dq, summands of V. By hypothesis, however, V +...+V = da/x + H’ and 
thus o vanishes identically on dy/x + H’. This is a contradiction, violating (22), 


and this establishes Theorem 1.5.1. The same proof can be copied line for line with 
I(S) in place of (SS). 


We now consider multiplicity estimates. There are two cases to consider, 
according to whether or not the /’-exceptional subgroup H is a proper subgroup 
of G. When H = G this means that the points in (S) for S large are 
well-distributed, and consequently one expects a very strong upper bound on the 
maximum multiplicity a section 0 € H°(X,kA) can have along I'(S). 


Theorem 1.5.2. Suppose X is an equivariant compactification of a commutative 
group variety G. Let I C G be a finite set and assume that G is the I’ -exceptional 
subgroup. Then for S > 0 


=. \r(ds)|°7 d-l 
A ae 
m(I(S), es ( iI'(S)| ( \r'(S)| 


Proof of Theorem 1.5.2. This is a straightforward consequence of Theorem 1.5.1. 
Indeed, Theorem 1.5.1 implies that 


=): . 


1 
e(I"(S), A) = a (a 


494 M. Nakamaye 


On the other hand we always have, by the multipoint analogue of (10), 
m(I'(S), A)e(I(S), A)? "|P(S)| < deg 4(X). 


Thus 
deg ,(X) 
[T(S)le(T(S), Aye 


m(I'(S), A) < 


Plugging in the lower bound for e(1°(S), A) from Theorem 1.5.1 gives Theorem 
La. 
Note that the first part of the right-hand side of Theorem 1.5.2 


ea) 
[P(S)| 
represents the minimal value possible for m(J"(S), A). The second term is pre- 


sumably unnecessary, while the third term, though certainly not optimal, is likely 
necessary in some form. 


We will next study the case where the J"-exceptional subgroup H is a proper 
subgroup of G so that the ultimate goal is Conjecture 1.1.9. As is normally the case 
with multiplicity estimates, one falls rather short of Conjecture 1.1.9 as, although it 
is easy to establish, using Theorem 1.5.2 for example, that there are many translates 
of subgroups in a given base locus, it is very difficult to show that these are not 
properly contained in some other base components. 

We will outline, following [NR], what can be done toward improving the classical 
multiplicity esimates when we assume that S is large and that J” is not contained 
inside the torsion subgroup of G. We begin with the key tool that will be used to 
enforce vanishing along subgroups and their translates. 

Lemma 1.5.3. Suppose H C G is the I’-exceptional subgroup. Suppose | >> € > 
0 is fixed and let 


—v(P.H)+e 


o<€H® (x, kA @ TE ) 
Then o|H = 0 for S > 0. 
Proof of Lemma 1.5.3. We must show that, given € > 0, for S sufficiently large 
m(I(S), A|H) < SVCD Te, 
By Theorem 1.5.2 for all S > 0, 


dy—| 
dy 


deg,(X) \77 [|r(ds)n HI dy 
m(I(S), A|H) < (a) Tranny | 
oo 2G 7, 2) 


SYA) 
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where the constant c(G, I”) does not depend on S. If 
0 Ego Ayte 
oe H(X,kA@ Ty) 
as in the statement of Lemma 1.5.3 and o|H +# 0, then m(I(S), A|H) > 


SLHD+€ which is impossible for § >> 0 according to (23). 


Lemma 1.5.3 also holds with ¢*(A) in place of A since the proof depends only 
on the numerical equivalence class of A. Suppose 


040€H° (x, kA @ TH) 


with 0 <e <1 as in Lemma 1.5.3. According to Lemma 1.5.3 and the differentia- 
tion process of [NR], Proposition 3.2, o vanishes to order at least k(S~’°42)~« — 
S~CHD+©) along H where H = H;. Choose 0 < B <_ 1 and suppose 
xe. (cS?) for c = codim(H, G). Note that for each y € (|S — cS? |) we have 
x+y € IS). Using the fact that m(I"(S), A) does not depend on the numerical 
equivalence class of A, we have for x € I (cS B ) 


m(I'(S), Alx + H) = m(I'(S), t*,(A)|x + A) 
<m(l (x + |S —cS?]) ,t%,(A)|x + H) 
=m(P ([S—cS*]), A|H). 
Thus 
olx + H =0 if m(I (|S —cS*]), A|H) < SOOM TE, 


This is deduced from Theorem 1.5.2 just as Lemma 1.5.3. Provided S > 0 then, 
using [NR] Proposition 3.2, o vanishes on x + H for all x € (cS?) to order at 
least 


k(S v(I,H2) €_§ acer 


Running the normal argument for zero estimates, as in [NR] §4, shows that there 
exists a subvariety V C X so that (LS B J) + V can be cut out by derivatives and 
translates of o, each having order of vanishing at least 


k(S v(1\H2) €_g MEH ye) 


Cc 


along I" (LS? J) + V. The same is true for I” (LS?) + Hy where Hy is the 
connected component of the stabilizer of V containing the origin. Thus we find 
an inequality of the form 


acard (7 (LS?) + Hy/Hy) deg,(Hy) < deg,(X), (24) 
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. k(S v(I,H2) €_§ v(.A1)+ ©) . : 
with a = - . Moreover, we claim that Hy > H. Since any 


c 
section of kA vanishing to order at least 


k(S v(1 2) €_g nas | 


c 


along I" (LS? ]) + y vanishes along H + yj, it is sufficient to apply Lemma 1.5.3 to 
I'(|S*|) + x for each x € V. Thus H C Ay, as desired. 

In order to apply this argument to force vanishing along H; and its translates 
for i > 2 there is a problem as one does not necessarily control m(I"(S), A|H;) 
in this case: indeed, fori = 1 we have Lemma 1.5.3 but this requires that H be 
the [°-exceptional subgroup. On the other hand, one can certainly apply the same 
argument with m(I"(S), A) — € in place of S~’“"#2)~€, The conclusion would then 
be that there is a subgroup H’, containing H, so that I” (LS? J) + H’ can be cut out 
by translates and derivatives of o and so 


poodim(’.G) card (r ([S°]) + H’/H’) deg ,(H’) < deg ,(X), (25) 


with B = mrs) Ae Formulas (24) and (25) fall short of Conjecture 1.1.9 in two 
respects. First is the superfluous factor of involving the dimension of X on the left- 
hand side which is inherent to all of the arguments. Secondly, the subgroup in (25) 
can not necessarily be identified with one of the obstruction subgroups H;. 


1.6 In Memoriam 


“Hi this is Serge Lang. So what are you doing?” These were the first words ever 
spoken to me by Serge, with a thick French accent, on the phone during the winter 
of 1990. I was applying to graduate schools and had written to Serge, whom 
I knew about through his many books, to inquire about the program at Yale. I 
never could have imagined the consequences of the small, innocent letter I sent 
to Serge. Indeed, as I would later learn, one of Serge’s most remarkable qualities 
was that he did not recognize the legitimacy of any form of “innocent” interchange. 
Every conversation, every statement, engages us in a commitment to some world 
view, to some normative judgment. Serge demanded that we recognize this density 
of discourse and consequently take full responsibility for all implications and 
presuppositions implied in the statements and claims that we make. 

While at other prestigious institutions I was readily dismissed (after all, how 
could an individual from a small liberal arts college with a degree in French brazenly 
pretend to do serious mathematics?), Serge made every effort humanly possible to 
stimulate and nourish my love for mathematics. He sent books and called several 
times each week to answer questions and help to guide my studies. Serge even 
lobbied on my behalf, to no avail, at the NSF as well as a few top graduate programs. 
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Once I decided to begin graduate school at Yale, Serge paid out of his own pocket to 
bring me down to Berkeley for the summer so that I could get started on my Ph.D. 
work. 

Shortly after my arrival in Berkeley, I was given a preliminary version of 
Faltings’ paper [F1] proving a generalization of the Mordell Conjecture. My initial 
project was to generalize this result to all subvarieties of abelian varieties, as was 
to be accomplished by Faltings [F2], and eventually to what Serge so fondly called 
pseudocanonical varieties. This did not seem like a very reasonable project at the 
time but, not knowing any better, I studied Faltings’ paper [F1] closely and set to 
work trying to generalize it to the case of arbitrary subvarieties of abelian varieties. 
Being more of a geometer than a number theorist, I tried to solve the problem with 
more sophistocated methods from higher dimensional geometry, not realizing that 
the necessary tools did not exist (or required much greater ingenuity than I was 
capable of), and so began the road toward becoming an algebraic geometer. 

Modern algebraic geometry was not Serge’s area of expertise and, caring first 
and foremost about the development of his students, he sent me away to Berkeley 
for my third year of graduate school, in order to profit from a year long program in 
algebraic geometry at MSRI, and then to Harvard for my fourth and final year. Thus 
it is that I spent a grand total of a mere 13 months in New Haven and saw Serge only 
three times after graduating in 1994. Yet the impact of the few moments I spent with 
Serge has been immense. 

For me, Serge was first and foremost the only person who ever truly believed in 
my mathematical abilities and consequently he always encouraged me to set lofty 
goals. Perhaps even more importantly in practice, he will always be a professional 
role model for me. Serge Lang was the embodiment of integrity, in academics, and 
more generally in life. He did not, like so many others in the same position, make 
judgments based on external trappings: Serge always looked beneath the surface 
and supported those whose talent he could sense, regardless of their particular 
circumstances. 

It is my sincerest hope that, in memory of Serge and the ideals he stood for, those 
in positions of influence and authority will exercise their power honestly and without 
prejudice to help all those who desire to contribute to the growth of mathematics. 
When education and knowledge become the monopoly of an elite class, the entire 
world, academic and otherwise, suffers and it should be a moral imperative for every 
teacher to reach out to the larger community without discrimination. 

Though in recent years I have largely withdrawn from the mathematical commu- 
nity, I have not abandoned my dream. One day in January 1997 I received a call 
from Serge. After his usual greeting, “Hi, it’s me,’ I told him that I had just been 
through surgery on a catastrophic brain tumor. Serge’s reply: “So can you still do 
math?” I did not have the heart to tell Serge the truth, namely that it did not matter 
and had not mattered since the first day I got up in front of a wonderful group of 
Math 122 students at Harvard. At that moment I had discovered my most fulfilling 
passion, namely teaching mathematics. I never told Serge, as I felt he would have 
been disappointed since he had such great hopes for my mathematical career and 
since he devoted so much time and energy to helping me. 
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Perhaps I was mistaken as Serge undoubtedly would have appreciated my desire 
to help others as opposed to pursuing greater success as a mathematician. On the 
other hand, Serge would certainly have chastised me for “chickening out” and with 
reason as I have assuredly not put my heart and soul into doing mathematics. In the 
end, there are simply too many other beautiful things in the world which I am not 
willing to sacrifice in order to prove better theorems. My admiration, however, for 
those who are willing to sacrifice is in no way lessened, particularly for someone 
like Serge who was always fully conscious of his intentions, desires, and needs, and 
was still willing to put everything aside in order to develop his mathematical vision. 
Having enjoyed a life of passionate pursuit of truth and equity, I hope that Serge 
rests now in peace, admiring the immense beauty of the mathematical world which 
he loved so much. 
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Sampling spaces and arithmetic dimension 


Catherine O’ Neil 


To Serge. It was an honor to be your favorite sophomore. 


Abstract This paper introduces the twin concepts of sampling spaces and arith- 
metic dimension, which together address the question of how to count the number, 
or measure the size of, families of objects over a number field or global field. It 
can be seen as an alternative to coarse moduli schemes, with more attention to 
the arithmetic properties of the ambient base field, and which leads to concrete 
algorithmic applications and natural height functions. It is compared to the definition 
of essential dimension. 


Key words essential dimension ¢ coarse moduli elliptic curves 


Mathematics Subject Classification (2010): 11G35 


1 Introduction 


This paper is the result of fomalizing a concept which was introduced to the 
author by her thesis advisor, Barry Mazur. Namely, it is the idea of a kind of 
parameter space which includes every example of a certain arithmetic type, and 
which can be studied in its own right. This leads directly to the definition of a 
“sampling space.” Here are a few motivations for this definition. First, one wishes to 
formalize and standardize the many discussions surrounding the ‘average behavior’ 
of arithmetic objects. Second, one wishes to find comparison theorems of various 
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different families of arithmetic objects. Third, one can ask for efficiency in these 
families; this question leads to the definition of arithmetic dimension. 

When dealing with classes of objects such as Galois extensions, elliptic curves, 
class numbers, or some kind of cohomology classes over a number field, one often 
feels somewhat stuck between a geometric point of view and a discrete point of view, 
owing to the inherent arithmetic. One may want to ask for the growth of a function 
related to a kind of complexity, such as the rank of a Mordell—Weil group or the size 
of a class group. However, the answer may well depend on how you count. What 
is inherent in the discussion is the concept of listing all examples of a certain type, 
usually with some equivalence relation in effect, and taking averages. However, the 
way you list the examples matters. For example, in listing elliptic curves over Q (up 
to isomorphism) you may list all curves up to a given discriminant, or you may list 
all curves with coefficients smaller than a given size. Each of these is natural but the 
answer you get will typically depend on which one you chose. 

Comparison theorems are intended to help us sort out the issue raised above of 
counting the same things in different ways. We shall see that many sampling spaces 
have natural height functions, as they are sometimes projective or quasi-projective 
schemes. Because of this, if we have two different sampling spaces for the same 
arithmetic objects, we can directly compare their heights. Since we are interested 
ultimately in listing all objects of a certain kind, then it obviously in our interest 
to make sure we have as little redundancy as possible. That is why we would like 
our sampling spaces to have minimal dimension. Indeed this can lead us to far more 
interesting and nuanced sampling spaces, as we will see in Section 5. 

We are primarily interested in functors which are not representable. One standard 
way around this problem is to introduce stacks. Although many of the same issues 
will arise as in the theory of stacks (such as group actions), one can think of sampling 
spaces as being much more algorithmically motivated. Indeed, sampling spaces are 
intended to be concrete and, when possible, explicit. 

One way to view a sampling space for a non-representable functor H is as 
an alternative to a coarse moduli scheme. In some sense, sampling spaces trade 
efficiency for completeness. A sampling space lives above its functor (i.e., maps to 
7) and loses no arithmetic information. By contrast, a coarse moduli scheme lives 
below its functor and does lose arithmetic information. The arithmetic dimension 
of a functor is the minimal dimension of a sampling space for that functor. The 
difference between the arithmetic dimension of H and the dimension of a coarse 
moduli scheme for H, if both exist, can be viewed as an arithmetic “bloating factor,” 
i.e., the number of extra parameters that one needs to, say, program a computer to 
completely list all arithmetic objects of a given type over a number field versus over 
the complex numbers. A good example is given by the functor €//, which associates 
to a base scheme S in the category Cx of finite-type schemes over a field K, the 
elliptic curves over S up to isomorphism. The coarse moduli scheme for €// is the 
j-line, a curve. However, to actually list all elliptic curves (up to isomorphism) over 
Q, for example, one needs both the j-invariant and a separate parameter to take 
into account all the quadratic twists of a fixed elliptic curve (see the example in the 
next section). It is not hard to see that this forces any sampling space for E// with 
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respect to the field Q to have dimension at least 2. Therefore there is one dimension 
of arithmetic bloat for elliptic curves over Q. 

This is, as described above, not a new idea. There have been many interesting 
definitions which try to measure the complexity of a functor based on the dimension 
of a parameter space for the objects involved. However, the definition we give below 
is new, in that it is more specific to the arithmetic of the base field. In particular, 
in Section 3.1 we compare our definition with that of the ‘essential dimension’ 
of a group, and we show that the two are measuring different phenomena; in 
particular, our definition is sensitive to the existence of rational points on quadratic 
hypersurfaces in large projective spaces over number fields. 


Acknowledgements The author would like to acknowledge the many conversations with Mira 
Bernstein which led to the definitions contained in this paper and conversations with Barry Mazur 
which motivated the investigation of these ideas. 


2 Definitions 


Let K bea field. Let Cx be the category of schemes of finite-type over K. 


Definition 2.1. An arithmetic object is a contravariant functor F from Cx to the 
category of sets. 


Definition 2.2. A sampling space for an arithmetic object F is the data of a scheme 
T in Cx with a natural transformation ® from the functor Home, (-,T) to F which 
is required to be surjective on L-points whenever L is a finite extension of K. 


Remark. We denote by ® : T —» F the fact that T is a sampling space for F via 
the map ®. When the map © is obvious we simply write T —> F. 


Definition 2.3. The arithmetic dimension of an arithmetic object is the minimum 
dimension of a sampling space for that object. 


Example. We will prove that the functor €// over a number field has arithmetic 
dimension 2 and over a finite field has arithmetic dimension at most 2. When K 
is a field of characteristic away from 2 and 3, every elliptic curve over K can be 
modelled (non-uniquely) by a Weierstrass equation of the form E : y? = x? + 
Ax + B, where A, B € K and where we assume A = 27B? + 4A? # 0. In the case 
that K has characteristic 3, when E is ordinary then one can always write E : y? = 
x3 + Ax?+ B; we can deal with the supersingular case separately with a finite union 
of points. Similarly, when K has characteristic 2 and when F is ordinary, one can 
write E : y?+xy = x?+Ax?4+B.! Therefore Spec(K[A, B, 1/A]) —> €11, and so 
the arithmetic dimension of €// is bounded above by 2. We will now show that there 
is no 1-dimensional sampling space for €// over a number field. Suppose we have a 
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curve sampling space 7; then T will map (by composition with the map €// > P\) 
to the j-line: T —» E11. The size of a fiber of this map is bounded on the one hand 
by the degree d of this map of curves. On the other hand, above a given j -invariant 
away from j = 0 or 1728, the fiber above a K-rational j-value corresponds to the 
set of all quadratic twists of a single elliptic curve with that j-invariant, i.e., with 
the set H'(K,{+1}) = K*/K*?. By the functoriality condition this would imply 
that for all finite separable extensions L/ K the set L* /L*? is bounded (by d). This 
is clearly untrue for number fields. 

As stated in the introduction, the above definitions allow us to make precise a 
measurement of the “dimension of pure arithmetic” of the functor €//, namely 
the discrepancy between the arithmetic dimension of €// and that of its coarse 
moduli scheme, 1. Also, the example of elliptic curves nicely illustrates some of 
the following more general observations: 


Remarks. (1) When H is representable by ascheme X inCx, then X is a sampling 
space for H. Moreover, the arithmetic dimension of H is just the dimension of 
X: any sampling space Y of H comes with a morphism ¢ : Y — X which 
is surjective on K points. Since the closure of the image of # has dimension at 
most dim(Y ), and since the image has dimension equal to that of the dimension 
of X we know dim(Y) > dim(X). 

A sampling space T of F comes with a tautological object: namely, take the 
image of the identity map Idy € T(T) to the corresponding T-valued point 
of F. 

If there is a scheme “below the functor’, then the arithmetic dimension of 
the functor is bounded below by the dimension of that scheme. For example 
if there is a finite-type K-scheme M and a transformation of functors F — 
Home, (-, M@) whose values on L-points [L : K] < oo form a Zariski dense 
collection of points in M, then the arithmetic dimension of F is at least the 
dimension of M. This will typically be the case if M is a coarse moduli scheme 
of F. 

Say we have a “arithmetic surjection” from an object F to an object H; this just 
means that there is a transformation of functors which is surjective on L-points 
whenever L is a finite extension of K. Then the arithmetic dimension of H is 
bounded above by the arithmetic dimension of F. 

(5) We do need to assume the surjectivity for all finite field extensions. For 
example, in [7], it is proven that for any finite field F there exists a curve over F 
which is “space-filling,” i.e., which has as many F-valued points as its ambient 
space Aj. for n > 1. However, it is impossible to produce a curve which is 
a sampling space for the functor associated to Aj, since the number of points 
in affine n-space over a finite field grows asymptotically much faster than the 
number of points of a curve. 

In particular, an infinite disjoint union of copies of Spec(K) has no arithmetic 
dimension. 
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3 Important example: cohomology groups 


Let G be a group scheme over K. Let Hj-(G) denote the functor taking a scheme 
X of finite type over K to the pointed set (which under certain restriction on G 
will have a natural group structure) Ay(X ,G). When X is connected and G is a 
finite étale group scheme we can identify elements of this set with G-torsors over 
X, i.e., Galois coverings Y — X with an isomorphism G = Auty(Y). When 
X = Spec(L) for L a separable extension of K, this is just Galois cohomology and 
is denoted by H!(L,G). 


Lemma 3.1. A sampling space T/K of Hi(G) has a “tautological” G-torsor 
Y — T. Moreover, whenever L/K is an algebraic extension, the surjective map 
®, : T(L) > H'(L,G) is given by pullback as follows, for p € T(L) thought of 
as amorphism p : Spec(L) — T mapping to the G-torsor By (p) : Yp — Spec(L): 

Y, —>Y 

P(p)) Of. 

Spec(L) ae 


Proof. By Remark 2 above, we take the image under this map of the identity map 
on T to get the G-torsor Y — T. Next, given p : Spec(L) — T we get a map 
T(T) — T(L) by composition: a : T — T maps to a o p (note that idr map to p 
itself). The above diagram then follows from functoriality and the fact that the map 
H}(T,G) — H'(L, G) is defined by pullback. Oo 


3.1 Comparison with essential dimension 


We refer to the many papers of Zinovy 

Reichstein available at http://www.math.ubc.ca/$\sim$reichst/pub.html. The es- 
sential dimension of a finite group G with respect to a field K is defined in [2] as the 
least dimension of a K-variety Y which is the target of a dominant G-equivariant 
rational map of a faithful linear representation of G. This definition is generalized 
in [13] to algebraic groups and in [1] is given a functorial perspective. Namely it can 
be shown using the results in that paper that the following definition is equivalent 
(at least when K is infinite). 


Definition 3.2. Let X — Y be a morphism in Cx such that X is a G-torsor over 
Y. This gives a transformation of functors Hom(-,Y) + H'(-,G). The essential 
dimension of G over K is the least dimension of an Y as above such that the map 
Hom(Spec(K’), Y) > H'!(K’', G) is surjective for all field extensions K'/K. 


The above immediately implies: 


Theorem 3.3. The essential dimension of G is an upper bound for the arithmetic 
dimension of He (G). 
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Remark. The essential dimension of G over K is sometimes unequal to the 
arithmetic dimension of H;,-(G). 


Remark. The above theorem is obvious when K = K, since the arithmetic 
dimension of : (G) is zero if K = K and G is finite. We show that even over 
number fields these two concepts are essentially different. In fact we show the 
following: 


Theorem 3.4. Let K be a totally imaginary number field and let n > 4; the 
essential dimension of (43, K) is 5 but the arithmetic dimension of Hy (3) is at 
most 4. 


Sketch of proof. We want to show that there is a four-dimensional scheme which 
maps onto (K*/K*?)°. So take (a;)\<i<s € (K*/K**)°. Form the equation 
psn a;x? = 0. This always has a point over K since we assumed K to be a totally 
imaginary number field. But then we can basically take the 4-dimensional sampling 


space to be 
5 


Spec(K[X;, X;"h<i<s/() aix; = 0). 
i=l 
To be complete, we would need to take into consideration the fact that the solution 
to ae a;x? = 0 may have some x; = 0. This will add a finite number of 
4-dimensional components to our sampling space. 


Question. What is the actual arithmetic dimension of H;-(3) for a number field 
K? How does the arithmetic dimension of H;- (3) grow as a function of n? Note 
that when the base field K has a trivial Brauer group the arithmetic dimension of 
file Guz) is probably much smaller than d; see the section below on Brauer groups 
for the relationship and the section below on finite fields for such an example. 


3.2 Brauer groups 


As a special case of the above, when K is a number field, we have an identification 
Hj.(PGL,) & Br(K)[n], where Br(K)[n] is the n-torsion of the Brauer group of 
the field K. Consider the functor on Cx givenby X  H fs (X, Gin)[n] (the n-torsion 
in the cohomological Brauer group of X). We will denote this functor by Brx [”]. 


Theorem 3.5. Let K be a number field which contains the nth roots of unity. Then 
the arithmetic dimension of Brx |n] is 2. 


Proof. The arithmetic dimension of Brx|n] is at most two, since every division 
algebra D of order n in the Brauer group of a number field containing a primitive 
nth root of 1 is cyclic. Namely, it can be written as D X (x, y|x” = a,y" = 
b,xyx~!y7! = C). Therefore G,, x G,, is a sampling space for Brx[n]. Next, say 
there is a 1-dimensional finite type K-scheme C which is a sampling space for 


Brx|n]. Denote by & the corresponding tautological object in Br(C)[n]. This Brauer 
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group element will manifest every cyclic algebra over any finite extension L of 
K as some fiber, since it is the tautological object of a sampling space. However, 
Tsen’s theorem implies (with a bit of work) that the Brauer group of the base change 
C Xgpec(K) Spec(K) is trivial. This means there is some finite extension L of K 
over which € becomes trivial. It is clear that there doesnot exist such an L which 
simultaneously trivializes all cyclic algebras. Oo 


Corollary 3.6. Let K be a number field containing the nth roots of unity. Then the 
arithmetic dimension of Hj (12) is 2. 


Proof. First, Gm X Gm is clearly a sampling space for H;-(7) over any field. 
Next, use remark 4 on page 502 above and the fact that the “norm symbol map” 
Hy (uz) > H?(K, Un) = Br(K)[n] is surjective. Oo 


Notice that the above proof is specific to number fields- in general, an Azumaya 
algebra is only a finite product of cyclic algebras even if we assume roots of unity. 


Question. What if I don’t assume any roots of unity? By Observation (3) below, 
since adjoining a pth root of unity to a field induces a prime-to- p extension, there 
is an upper bound on the arithmetic dimension of the functor Brg[p] of 2(p — 1). 
Can this be improved? 


Observations. 


1. There is no sampling space for the entire Brauer group over a number field. 
Namely, a given element of the Brauer group (cohomological or not) of a finite- 
type scheme over a field has a given (finite) order. But elements of the Brauer 
groups of number fields have arbitrary order. 

2. When G is a finite group scheme over K of degree 7 the arithmetic dimension 
of Hj-(G) is bounded above by n. Namely, the regular representation (which is 
the action of G on I'(G, Og)) is faithful, and then the results of [2] imply the 
essential dimension of the group scheme G over K is bounded by its dimension, 
which is n. Hence also the arithmetic dimension is bounded by n. 

3. Let G be a finite commutative group scheme of order n over K. Let K’/K be 
a finite separable extension of degree d prime to n. For any finite extension 
L/K we get a sequence H'(L, G) = H'(L @x K',G) = H'(L,G) whose 
composition is multiplication by d. On the other hand, suppose that 7’ is a finite 
type K’-scheme with a transformation Home,,(-,7’) - Hx(Gxr) which is 
a sampling space over K’. Set T = Resx//x(T’). For a finite field extension 
K C L consider the composition 


cor 


T(L) = T'(L @x K’) > H'(L @x K',G) > H'(L,G). 


By what was said above and since (d,n) = | this is surjective. Hence we see that 
T is a sampling space for H;-(G), provided we can show these maps fit together 
and come from a transformation of functors Hom(-, 7) + H;(G). Granted that 
this is indeed the case we see that dim(T) = ddim(T’). In other words we 
see that the arithmetic dimension of Hj.(G) is at most d times the arithmetic 
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dimension of H (Ge x’). To construct the transformation Hom(-, 7) > H;-(G) 
we have to construct a G-torsor over T. Let X’ —> T”’ be the G-torsor 
corresponding to the transformation Hom(-,7’) — H ahe. x’). Consider the 
morphism Resx/x(X’) > T = Resx/x(T’). This is a Resx7/x (Gx-)-torsor. 
Since G is abelian there is a “norm” map Resx’/x(Gx’) — G which is a 
homomorphism of group schemes over K. The push-out of Res x/x(X’) by this 
norm map gives the desired torsor. 

4. If G is a group scheme and if we can embed G into an “H!-trivial group scheme” 
H (such as GL, or SL,), then we have 0 > G > H — H/G — 0. There is 
no guarantee that the scheme H/G is a group, but it still makes sense to take 
the long exact sequence of cohomology to the extent that we get 0 > G(K) > 
H(K) — (H/G)(K) > H'(K,G) > H'(K,H) = 0. That is, we get the 
sampling space H/G. For example, if G = ,, we can take H = G,, to recover 
sampling space G,, for H . (Ln). 


4 Over a finite field 


Let p be an odd prime, let IF, be the finite field with p elements, and let d be a 
positive integer. For i ranging between 0 and d, define the map f; : F,\{-i} > 
Fy / F*, sending the element x to [x +7]. Katz proves (Corollary 1.4.2.2, page 25 of 
[8]) that as p goes to infinity, the maps f; have “equidistributed” and independent 
values. In other words, the map (fo, fi...., fa) : Fp\{0,—-1,-2,...,-d} —> 
(F*/F*?)4+1 will asymptotically send x to any given element of Z/2Z¢*! with 
equal likelihood. 

The following theorem says nothing about equidistribution, but it generalises 
Katz’s result in that it works modulo nth powers, and also because the conditions on 
the functions are very weak. 


Theorem 4.1. Let F be a finite field of characteristic p. For an integer n prime to p, 
the arithmetic dimension of Ai (uw) is 1. In other words, there is a curve sampling 
space defined over F which parameterizes d “independent” nonzero elements of F 
mod nth powers. 


Lemma 4.2. Let fi, f2,..., fa € Flt] be functions with the property that the ring 
REPO Yer alOT j= tak = fi 3 = fo Crk 
is actually a field. Define the set S = {a € F| f;(@) = 0 for some 1 <i < d}. The 


curve A'\S is a scheme defined over F, and becomes a sampling space for H} (uw) 
whenever L is a sufficiently large field extension of F. 


Proof. Fix fj as above, and for any field extension L/F, define the map 


AL\S — (L*/L*")4 
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sending a@ to the element ([/f\(@)],..., | fa(@)]). This will be a sampling space if, for 
an arbitrary element ([a;], [a2],....[aa]) € (L*/L*”)¢, there is an a and there are 
yi, 1 <i < d, such that for alli,a;y? = f;(a). These equations define the function 
field of a curve C(ja;},{a9),....{az]) Which is a cover, say by the map z, of P! given by 


sending the pair (a, (1, y2,..., ¥a)) to a. Moreover, since the ring R above is a 
field, this curve is irreducible (a geometric condition, and so we can assume that 
all of the a; are perfect nth powers). In order to get a sampling space, we need to 
show that the curve C({q,},{a9],....{ag]) has an L-rational point outside the fibers above 
S whenever L is large enough. We will make use of the Riemann Hypothesis for 
curves over finite fields: 


#C®)=g+1-) a, 


where the a; are the eigenvalues of the Frobenius operator. The absolute value of 
each @; is bounded by ,/q, and the number of eigenvalues is 2g, where g is the 
genus of C. Moreover, the genus of C(ja,},Ja9).....laq]) depends only on the degrees of 
the f;’s and the degree, n@, of the map z. Finally, the number of rational points 
“hiding” above points in S is bounded again by the degrees of the f; and by n and 
d. Thus when the size of L is large enough, we are assured of a choice of a and y; 
as above. Oo 


We can view the above results as saying that over number fields, we have a 
stronger version of Kummer theory, and over finite fields we have a very strong 
version of Kummer theory. 


5 Models of genus one curves 


Let n > 3 be an odd integer, and let K be a field of characteristic prime to n. For 
an elliptic curve E with origin Og defined over K, fix a “base diagram” fz : E > 
P”~! given by the full linear series associated to the line bundle n - Og. A choice 
of fg induces, by passing to automorphism group schemes (see [11]), the following 
commutative diagram of group schemes, where each the row is exact and the upper 
row is a theta group as defined in [9], page 221: 


0+ Gn > Gr -> E[n] > 0 


1 1 1 (5.1) 
0 > G, — GL, — PGL, — 0. 


Definition 5.1. Define the map Ob : Hj (E[n]) > Hj(PGL,») > Hz (Gm) by 
composing two induced maps on cohomology from diagram (5.1) above. 


Definition 5.2. The functor Hoy is the set-theoretic kernel of the map Ob. 
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Remarks. (1) An element of Hop,(K) can be seen to correspond to a certain kind 
of twist of the base diagram f, namely a diagram “of index n,” C > P""! 
where C is a genus-one curve whose Jacobian is isomorphic to E. A general 
element of H'(K, E[n]), not necessarily lying in Hoy(K), would be a more 
general twist, namely a diagram “of period,’ C — S, where S is a Brauer— 
Severi variety of dimension n — 1. Moreover, every index-n object will be 
represented in Hop. 

(2) For example, when n is 3, the elements in Hop correspond to all cubic curves 
(up to PGL3-action) whose Jacobian elliptic curve is isomorphic to EF. 

(3) The map Ob is quadratic, so Hop (X) is not generally a group. 

(4) All of the elements of the n-Selmer group for E will appear in Hop. For this 
reason we are interested in finding small sampling spaces for Hop; it would be 
the first step towards finding a sampling space for the Selmer group of F. 

(5) Assume that 1 is odd and the characteristic of K is prime to n. Assume that 
E is an elliptic curve over K with E[n](K) C E(K), ie., as group schemes, 
E[n] = Z/nZ x Z/nZ. Then Hop C Hy(E[n]) C Hy (Z/nZ x Z/nZ). In 
[12], it is proved that the arithmetic dimension of Hop is at most 1, and in the 
cases of n = 3 andn = 5 an explicit sampling space of dimension n has been 
found for this situation (see also Corollary 5.12 below for an improvement when 
n = 2). The image of the map Ho, > Hj (Z/nZ) x Hj.(Z/nZ) is the set of 
trivial norm symbols of level n. 


5.1 The general case: lower bound 


Theorem 5.3. Let K be a number field, n > 2 an integer, and E an elliptic curve. 
The arithmetic dimension of Hop» is at least 2. 


Proof. First we reduce to the case that we have “rational n-torsion,” i.e., that E [n] & 
Z/[nZ x Z/nZ : this is so after a finite extension, and any 1-dimensional sampling 
space for Hop, would base change to a sampling space for the rational n-torsion 
case, so it is enough to show that no such sampling space exists. We will make use 
of the fact that in the case of rational n-torsion, Ob is a norm symbol map. 


Lemma 5.4. Let K be a number field. Given a nontrivial element a € K*/K*", 
the set of b € K*/K*" such that the corresponding norm symbol (a, b) is trivial is 
the infinite set N xc xan (K(V/a)*)/K*". 


Proof of lemma. That the set described in the lemma is as claimed is the definition 
of the norm symbol. That it is infinite follows from local considerations. oO 


Now assume that we have a one-dimensional sampling space T for Hop». It 
consists of a finite union of schemes of dimension 0 and 1. By the pigeonhole 
principle and the above lemma, there exists some irreducible component of 7, a 
curve C, such that the rational points of C are responsible for many of the trivial 


Sampling spaces and arithmetic dimension 509 


norm symbols, i.e., that for infinitely many classes a € K*/K*" and infinitely many 
bENg vay KK (4/a)*)/ K*" there exist rational points x(q») € C(K) mapping to 
their respective points in Hop.n. 

The tautological object of T is an element of AA (T, [Ln X [dn), and as such gives 
rise tO a [Ly X [Ly field extension L(T)/K(T), where K(T) is the function field of T. 
Similarly we get a field extension L(C)/K(C). Since ¢, € K, by Kummer theory 
there exist elements d, and d2 € K(C) whose nth roots give the extension L(C). 
Moreover, the functions d; define maps f of C onto G,,. We may now construct 


the map C > Gy, x Gp, = Gm where the second part is the second projection map. 
But by construction, the composite map is both nontrivial and has infinite fibers, so 
C cannot be a curve of finite type, a contradiction. Oo 


5.2. The general case: upper bound 


As above, fix a “base diagram,” an embedding f : E — P”~!. Then the finite 
group scheme E[n] acts on E and extends to a faithful action on P”~!; in other 
words, we get an injective map of group schemes i : E[n] — PGL,, and we 
can define the scheme P”~!/E[n]. This will not be smooth, because although 
the E[n] action is faithful, there are points with nontrivial stabilizer. Denote by 
P”"—!/E[n]™ the smooth (open) part of this quotient. A lift > ¢ P"~!(K) of a point 
v € P™!/E[n}™(K) is “almost rational,” meaning that for each o, there exists 
T(o) € E[n](K) such that 7 = i(T(c))(%). In this way we can form a cocycle 
&, for any such v, and we can map &, to H!(K, E[n]). Moreover, a different choice 
of a lift of v will differ by a coboundary. Thus we have defined a map of functors 
&: (P"1/E[n))™ > Hop. 


Theorem 5.5. The scheme (P"~!/E[n])™ is a sampling space for Hop, via & : 
&:(P""'/E[n])™ > Hoo. 
Moreover, the following diagram commutes: 


E => E/E[n]) = E => E/nE 


f i \ Ll . (5.2) 
pr-l + P"!/E[n] — Hon > Hy (E[n)) 


where the rightmost column comes from cohomology, sending P € E(K)/nE(K) 
to the cocycle O° — Q ifnQ = P. 


Proof. We know that E[n] C PGL,, (see diagram (5.1) above) and we can extend 
this to an exact sequence of schemes E|[n] — PGL, — PGL,,/E [n], since the action 
is fixed-point free. Note that since E[n] is not normal inside PGL,,, the quotient 
scheme is not a group scheme. However, we can still make sense of the first part of 
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an exact sequence of cohomology: 


PGL,(K) > PGL,/E|n](K) > H'(K, E{n]) 2 H'(K,PGL,). 


We immediately see that the scheme PGL,,/E|n] is a sampling space for Hop. 
We can cut down the dimension considerably by noting that the group PGL, acts 
faithfully on P’~!, so we can replace PGL, / E[n] by its compression P”~!/ E[n].? 
Concretely, any lift of a rational point of the scheme PGL, / E[n] to PGL,(K) is a 
matrix M which is almost rational, just as above: for every element 0 € Gx, we 
have M° = i(T(o))- M. The image of M in PGL,,/E[n](K) maps to the cocycle 
o +» T,. Moreover, such an M can be thought of as moving C to E in the sense 
that, starting with equations F;(x) for E, the equations for C will be F;(M x). For 
more on this perspective see [10]. The above compression amounts to the fact that 
one column of such an M will suffice to represent the cocycle o +» T,. Or indeed, if 
v is any rational point in C-space, then M -v, the image of v in E-space, will suffice. 
See the examples worked out below for an explicit representation of the matrix M 
starting only with one of its columns. 

Now for the commutativity of the diagram. The composite of the first two maps 
on the first line is multiplication by n, a rational point of E’/ E'[n] is just the class of 
a point O € E(K) such that nQ = P is rational. The third map sends the class of 
QO tonO = P, and the connecting map from cohomology on the right divides P by 
n again. On the other hand, the map E/E[n] > P"—!/E[n] is defined over K and 
sends the class of Q to some rational point of P’~!/E[n] and then associates to that 
point a cocycle which is clearly equivalent. Oo 


Corollary 5.6. The arithmetic dimension of Hop, is at most n — 1. 


Corollary 5.7. Let K be a number field. The arithmetic dimension of Hop3 is 2. 


5.3 Visualizing elements of Hop 


We take a definition from Section 3 of [5]: Let € € H!(Gx, E) for an elliptic curve 
over the number field K. Suppose we have an exact sequence of abelian varieties 
0-> E—> J — B — 0. We define € to be visible when & is in the kernel of the 
induced map H!(Gx, E) > H'(Gx, J). 

We will specialize the situation a bit. Since H'(Gx, E) is a torsion group, the 
element & has some order 1; we will assume that n is odd and at least 3. Moreover, 
assume that we have found another elliptic curve E’ such that E[n] = E’[n]. Then 
we can construct the following exact sequence: 0 > E —> (Ex E’)/E|[n|] > E' > 
0, where we have glued E and E’ together along their common subgroup scheme 
E[n] and we have identified E’ with E’/E'[n]. 


This idea of compression in the case of a faithful representation comes from the papers of 
Reichstein. 
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Then & is visualised by the abelian variety (EF x E’)/E[n] exactly when & lifts 
to a point P of E’(K) = H°(Gx, E’). This means that there is a point 0 € E'(K) 
where 7Q = P and where (0) = Q° — Q. This in turn is exactly the condition 
that the following diagram holds: 


P (a OF -O) 
0 > E'(K)/nE'(K) > H'(Gx, E'[n]) > H'(Gx, E')[n] > 0 
= 


0—> E(K)/nE(K) > H'(Gx,E[n]) ~ H'(Gx, E)[n] > 0 
(oe 0" = Oj) : 


We will abuse notation slightly by saying that in this case, & is visualised by E’. 


Remark. The above approach to visualisation was explained to the author in a talk 
by Tom Fisher at the “Explicit Arithmetic Geometry” conference in Paris, December 
2004. 

Now let E be an elliptic curve defined over K, and let X(n)* denote the twisted 
modular curve parameterizing pairs (E’/S, E[n]s = E’[n]), where E’ is an elliptic 
curve over a base S and E[n]s — E’|n] is an isomorphism of group schemes over 
S respecting the Weil pairings. Let € denote the total space of the universal elliptic 
curve over X(n)". Then E[n] acts on € ina natural way. 


Theorem 5.8. The base diagram f : E > P""! gives rise toa map E€ > P""!, 
which for n = 3 is a birational map and which for larger odd n is an embedding of 
€. Moreover, the following diagram (of functors) commutes: 


E > €E/Efn) = E > E/n€E 
| | 1 1 (5.3) 
pr-l _y P"/E|[n] — Ho, > Hy-(E[n)). 


Proof. First assume that the map € > P"7! exists and fits in the above diagram. 
Assume that there is a geometric point x in the intersection of two elliptic curves 
E’ and E” in the family of elliptic curves €. The first map on the top row above 
sends x to the intersection of E’/E[n] and E”/E|[n]; since this is essentially the 
multiplication by n map, a degree n? map, we know that the n* preimages of the 
image of x all lie on the intersection. Now we will show that two curves intersecting 
inn” points will be forced to be the same curve when n is at least 4. If there is a point 
y lying on E’ but not on E”, we can project away from that point to land in P”~? 
space, and E’ will now be a degree-n — | curve and EF” will still be a degree n curve. 
Now we project away from points lying off of E’ and E” until we get down to the 
projective plane; then the two curves will intersect in n - (n — 1) points by Bézout’s 
theorem, a contradiction. Note that this argument does not work for n = 3, and 
indeed it is well known that in this case € is birational to P?; with our construction, 
the elliptic curves in € will all intersect in their 9 flex points. 
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Now we need to show such a map exists. We sketch how to produce the morphism 
€ — P""!. Namely, let E’ be an elliptic curve over a field extension K’/ K endowed 
with an isomorphism @ : E[n]x: — E’[n] compatible with the Weil pairings. Recall 
that we have fixed the base diagram fg : E — P"~! which gives rise to an action 
of E[n] on P"~! by the associated map i : E[n] — PGL, which reproduces the 
Weil pairing via (5.1). Choosing a basis of '(E’, Og (n - O)) gives rise to a similar 
diagram fz, : E’ > P77! andi’: E’[n] + PGL,,x, reproducing the Weil pairing 
on E’|n}. We claim that there is a unique automorphism 6 : P’;' > P%;' such 
that 6 intertwines the action of E|n]x, via i and base change with the action of 
E[n]x via w and i’. Although we leave this to the reader, we point out that it is 
straightforward to prove this over the algebraic closure and then deduce the general 
case using that a solution B is unique since there are no automorphisms of P’~! 
commuting with the action 7. The upshot is that there is a unique choice of the 
embedding fz, : E’ — P%7' so that the resulting action i’ agrees with the action 
i via a. Doing this procedure over the base X(n)" gives a canonical morphism 
Eo Pe, 


Commutativity of the diagram. By construction the morphism € > P”~! is E [n]- 
equivariant, which gives us the first commutative square. The last commutative 
square is obvious. To see that the middle square commutes, consider a fibre FE’ 
of € over some K’ point of X(n)”. By (5.2) the diagram commutes with E replaced 
by E’. Hence it suffices to see that the obstruction map Ob only cares about the 
isomorphism class of the pair (E[n],e) where e is the Weil pairing. This follows 
from the results in Section 4 of [11]. oO 


Definition 5.9. Denote by Hj-(E[n])® the image of € in Hj,(E[n]). This is the 
visualised part of Hy (E[n]). 


An open question is to what extent H K(E [n])© contains the n-Selmer group of 
E. To this end we can recover one result from [5]: 


Corollary 5.10. The 3-part of the Selmer group of an elliptic curve is always 
visualized. 


Proof. In fact the above proves that all of Hop» is visualised, and we know that the 
3-torsion in the Selmer group lies inside Hop. oO 


Corollary 5.11. When n = 3, the universal elliptic curve E as defined above can 
be seen as a sampling space for Hop; it is a rational surface. 


Remark. We already knew that P?/ E[3] is a sampling space for Ho, whenn = 3. 
It is a unirational surface, and it is known to be rational over K. The above corollary 
tells us that it is in fact rational over K, at least when the modular curve X(n)* has 
a rational point, as it does here by construction, since we constructed it so that the 
point corresponding to E itself is rational. See the next section for explicit equations 
for n = 3 in the case where E[3] X Z/3Z x p13. 
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5.3.1 Example 


When n = 3 and E[3](K) = E[3](K), we will explicitly compute a rational 
parameterization of the sampling space T3 for Ho» and the tautological curve above 
it. We will also compute the generic trivial level-3 norm symbol. 

Let ¢ denote a fixed primitive third root of unity. We may standardize the action of 
the 3-torsion as follows: take a basis (S, T) of E'[3] such that S sends the coordinate 
x to x, y to Cy, and z to €7z, and such that T sends the coordinate x to y, y to 
z, and z to x. Defines = xyz,t = +y42u=xrytyr4i2ev= 
xo yr +yo4z2ox3 andy = x3 yo+y3z°+z3x°. These are clearly invariant under the 
above action, and they satisfy the relations fj : v+¥ = tu—3s7 and fo : vy = 9s°— 
6tus? + u> + t°s°. Our surface T3 is an open part of Proj(K[s,t,u,v,¥]/(/. 2), 
where s and ¢ are of weight 1, u is of weight 2, and v and ¥v are of weight 3. 

Note the above action of [3] forces E = E) to lie in the family 


By: Xk? Y? 7? =31XFZ =O 


for some A € K. Fix O = (1;-1;0). 

We will find a rational parameterization of S by identifying S' birationally with 
E as in Section 5.3. To be exact, note that if we choose a rational point of P?, we 
can (almost always) uniquely solve for A above: A = * ees ate = x ? 

Note that € also has a natural elliptic curve family in Pz sitting over it: a rational 
point P of € lies on an elliptic curve E’; over such a point P the elliptic curve is 
simply E’ embedded in P? by the line bundle corresponding to 2- O + P. This is 
not the family we are looking for; rather, we are looking for the family of genus-one 
curves all of which have Jacobian isomorphic to the fixed EF. The fact that two such 
curves lie above each point means that their corresponding cocycles are visualised 
(see below). 

From Theorem 5.8 we see that if we identify € birationally with P?, we can 
express the multiplication by 3 map on P?, denoted by [3], as a function on P?/ E [3] 
and this will give us its birational model. On the level of points, we are taking a 
rational point P of P? and finding the elliptic curve EF p it lies on as above; we then 
extract a QO on Fp such that 3- Q = P, and form the cocycle O° — Q. 

Note that even with the general Weierstrass equations for elliptic curves, this 
makes sense, but would require knowing how to extract this cube root in general. 

The function [3] vanishes on the hyperplane Z = 0 exactly at those points Q 
such that 3- Q = O,S, or —S. Similarly, [3] vanishes on the hyperplane X¥ = 0 


3 What if we can’t solve for A? This happens when one and more of X, Y, and Z is zero. Two being 
zero would mean the point is rational, which gives us the trivial cocycle. Acting by T above if 
necessary (which only changes the cocycle mod coboundaries) we can assume our point looks like 
(0; 1; w), and so the corresponding cocycle & sends o to (0; 1; o(@)), but on the other hand sends 
it to itself acted on by &(o) € E[3]. We see then that (0) = i - S for some i, so & pulls back 
to H'!(Gx,Z/3Z) = K*/K*>. However these cocycles are also covered by the points (1; @; @7), 
which have well-defined 1. 
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exactly at those points Q such that 3- Q = T,T + S, or T — S, and [3] vanishes on 
the hyperplane Y = 0 exactly at those point Q such that3- Q = —T,—T + S, or 
—T-S. 

For every T € E[3], there is a cubic form ar which intersects E, in exactly 
those points Q such that 3- Q = T. By work in [4] we can easily compute such ar, 
well-defined up to (the same) scalar, when T 4 O as T,- Mr -v, where v is the point 
(x; y;z) € P? represented by the column vector (xy z), and 7, is the tangent 
plane to v represented by the row vector (x?—Ayz y?—Axz 2°—Axy). Moreover we 
can take ao to be the Hessian cubic taking v to —2A?(x? + y? +23) + (8—2A3)xyz. 
Then the composite birational map [3] : P? > P?/E[n] = P? is given by 


[3]: x  ((a@rar+sar—s)(X); (€-ra_r+sa—r—s)(X); (€odsa_s)(x)). 


We know that these functions can be written as functions on S, i.e., as functions of 
S,t, u,v, and V as above, and indeed we have the following: 


(arar+sar—s)(x) = (1—A°)(0— 357) 
(a_pa_r4sa_r_s)(x) = (1—A3)(v — 357) (5.4) 
(aoasa_s)(x) = (t? — 3u)(—2A2t + (8 — 2A3)s). 


We can substitute 1 = +, and the map simplifies to 


uke 
35 
[3] : (x, y,z) & (0 — 387; v— 38°; 85(t? — 3u)). 


This map factors through T3 C Proj(K[s, t, u,v, ¥]/(fi, f2)), namely by sending 
a point (s;t;u;v;P) to (¥ — 357; v — 353; 8s(t? — 3u)). Since all the scheme maps 
in Theorem 5.8 in the case n = 3 are birational, this gives us a birational model 
of our sampling space; indeed we will now compute the inverse map. Starting with 
(a;b;c) € P?, we will compute its inverse image in T3. First, over an open part 
of our surface we may assume s = 1, in which case we can recover v = b + 3 
and v = c + 3. Moreover, since both (a; b; c) and its preimage (x; y; z) lie on the 


@tb+c3 _ tt _ 
3abc ~ 35 t/3. 


same curve £,, we know that A = t(a,b,c)/3s(a,b,c) = 


3 3 ee 
In other words, tf = ath tc 


1/3(2 ee y 

abc 

Next we will explicitly compute the tautological genus-one curve lying over T3. 
To do so, start with a rational point P € T3(K) and a lift (x; y;z) in P?(K). The 
matrix Mp which takes f : E — FP? to the fiber above P, fp : C — S, satisfies 
the following property: for any o € Gx, 


. Finally, we can solve for u to get u = —a/24 + 


M> = Mro)Mp, <=> y = Mro)|y]; 
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and any such matrix M € PGL, which satisfies that property will bring f : E > P? 
to some diagram equivalent to fp : C — S. With that in mind, we will define 


x 0 0 
Mp=|0 y 0 |Q?l4+y3M7!4+2Mz). 
0 0 z 


A calculation shows that it satisfies the above property. To find the tautological curve 
over T3 it remains to act by Mp, that is, to expand Fc(x) = F(Mp(x)), where 
F(X, Y,Z) = X3+ Y?4+ Z3—3\ XYZ is the cubic giving E. By construction the 
coefficients will be functions on T3. A calculation gives us: 


Fo(X,Y, Z) = (t* — 4ut? + 2u? + 387t)X? + (vt — w?)¥? + (ot —u?)Z? 
40 =i Os XCY & Ost yor’ 2 + Gr — tt = 65°) 
xZX 4+ Oy = 3e 4 3x2 + Oi =u = Or OY? xX 
Gr h=vZ2-Y + Or aT XYZ. 


Corollary 5.12. The generic level-three trivial norm symbol is given by a rational 
map P?/E(3] = Ts; > G2, which sends the image of x € P? to (as(x)?, ar(x)°), 
where S and T generate E{n]. Explicitly, it sends (s;t;u;v;V) to 


(307v + 3¢0 + 23 — Qut —v—P+t 65°, v + 3su + 357t + 65°). 


Proof. The function ar can be computed as ar = T,- Mr -v. For anyo € Gx, 
we then have a7 = (T,)° - M?-v° = Ty - Mr - v’, since the Gx action on E[n] 
and thus M7 is trivial, and since the entries of T,, are polynomials in the coordinates 
of v. By construction, v? = M7y,,)v, and a calculation yields Tyo = T, Mr): so 
ay = [Mr, Mri] - Ty: Mr - v. By work in [10], [Mr, Mr(.)] = e(T, T(0)). oO 


What if we assume a slightly less stringent condition, namely that E[n] = 
Z/nZ xX [t,? By previous observations, we can “base change” to K(¢,,) and use 
the corestriction map to get a sampling space over K. However, the dimension gets 
multiplied by the degree of the field extension K(¢,,)/K, which divides @(n). 

In fact, when E[n] =~ Z/nZ x py, then Hp (E[n]) = Hy(Z/nZ) x Ap (Un): 
the arithmetic dimension of H ree Ln) is of course 1 when K is a number field (since 
its set of K points is infinite but a sampling space is G,,,). However, the arithmetic 
dimension of Ag (Z/nZ,) is bounded by the essential dimension of Z/nZ, which is 
expected to be unbounded (and related to #(7)). Note that this is equivalent to asking 
for the dimension of a versal family of Z/nZ-extensions of Q, a classical problem 
from inverse Galois theory. In other words, the functor Hg», which is carved out of 
the functor H'(E[n]), seems to get more complicated as the Galois action on E[n] 
becomes more complicated. The conjecture says that in spite of this, Ho, becomes 
no more complicated than growing linearly with n. 
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6 Further remarks 


¢ TIwould like to compare arithmetic dimensions over number fields with those over 
p-adic fields and finite fields. Since many schemes automatically have points 
over such fields, the arithmetic dimensions of functors such as H 7(G) over such 
fields should generally speaking be smaller. Of course I would like to make this 
precise. Certainly such things as the (functor associated to the) Brauer group over 
such fields is understood to be smaller. 

e« A major motivation for the definitions in this paper is to explore the question 
of finding a sampling space for the Tate-Shavarevich group of a fixed elliptic 
curve F over a number field K (suitably extended to a functor on the category 
Cx) or prove that no such sampling space exists. Note we have some freedom 
in extending the functor to the full category Cx. However, the most obvious 
approach would have X € Cx map to Ker(H}(X, E) > |[ex He @K 
K,, G). In this case, we would have a tautological genus-one curve lying over 
any sampling space. It is not hard to see that this would imply that there is a 
(uniform) bound for the order of elements of the Tate-Shafarevich group of EF 
over all finite extensions of K, a very strong result. However, there may be more 
nuanced ways to extend the functor. What is probably an easier task is to find a 
series of sampling spaces for the p-power torsion of the Tate—Shavarevich group, 
for all primes p. 

In an analogy for Hilbert symbols, if the ground field K is large enough to 
admit rational p” torsion for all (in which case K is not a number field), 
then an element of III(£) would correspond to a coherent sequence of trivial 
Hilbert symbols of level p” for all n. The above assumption on the ground field 
is Iwasawa-like; if possible we will work without such an assumption, so as to 
stay over number fields. 

¢ One major goal is to not only have a sampling space for a functor but to have a 
nice sampling space, namely one that is as close as possible to A”. As we have 
seen, this is often attainable. We want nice sampling spaces primarily because 
then the points are extremely accessible (i.e., so we really have an analogy with 
the concept of a “parameter space’’), but also because we have a well-defined 
notion of height of a point. This could be useful in quantifying the arithmetic 
objects that we are parameterizing. Often we have a different notion of height 
on the objects already, for example by looking at the size of the coefficients of 
the objects. We would want to prove “comparison theorems” in order to justify 
the height measurements coming from our sampling space. An example of a 
comparison theorem would state that if the maximal height of a coefficient of a 
model of a cubic equation C is D, then there is a point of the height at most 
f(D) on the sampling space of cubics which maps to C, where f(D) is a 
simple function of D. With the aid of a comparison theorem we would be able to 
compute the asymptotic behavior of our arithmetic objects. 
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Another benefit of having a good sampling space for objects, especially with 

an accompanying explicit tautological model of the objects, is simply that we can 
test for, say, local points on the models. 
There are numerous simple questions one can ask about sampling spaces which 
give rise to interesting and basic questions about elliptic curves and the arithmetic 
of number fields. For example, we know that the arithmetic dimension of Hj (An) 
is | when K is a number field, but is G,, (with the standard map) the only 
sampling space? Is there an elliptic curve, an open part of which acts (with an 
appropriate map) as a sampling space? First note that any sampling space would 
have to map (rationally) through G,,, with the standard map after we adjoin the 
nth roots of unity, since Kummer theory will supply us with an element in the 
function field of the sampling space. Then (an open part of) an elliptic curve 
sampling space would map through G,,,; clearly it would not be possible for the 
map to be a homomorphism of the group schemes, since the Mordell—Weil group 
is finitely generated. However, it is possible it could be some other map, like 
sending a point (x, y) € E(K) to say y € G,,(K). This raises the question of 
“growth” of rational points as K grows: the functor Hy ([n)(K) = K*/K*" 
enlarges with any non-trivial extension L/ K, and so would any sampling space. 
Therefore we couldn’t possibly have an elliptic curve sampling space unless 
we found an elliptic curve whose rank grew with every extension. This is very 
unlikely but I don’t know a proof that no such elliptic curve exists except over 
Q. Note finally that by Faltings’ theorem no genus-2 (or higher) curve could be a 
sampling space for Hj-({1,); another related question would be, is there another 
map (whose degree is larger than 1) making G,, a sampling space in a different 
way? 
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Abstract We develop the global theory of a strategy to tackle a program initiated 
by Bogomolov in 1990. That program aims at giving a group-theoretical recipe by 
which one can reconstruct function fields K|k with td(K|k) > 1 and k algebraically 
closed from the maximal pro-¢ abelian-by-central Galois group 7; of K, where ¢ 
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1 Introduction 


Recall that the birational anabelian conjecture originating in ideas presented in 
Grothendieck’s Esquisse d’un Programme [11] and Letter to Faltings [12] asserts 
roughly the following: First, there should exist a group-theoretical recipe by which 
one can recognize the absolute Galois groups Gx of finitely generated infinite fields 
K among all profinite groups. Second, if G = Gx is such an absolute Galois group, 
then the group-theoretical recipe should recover the field K from Gx in a functorial 
way. Third, the recipe should be invariant under open homomorphisms of absolute 


F. Pop (24) 
Dept of Mathematics, University of Pennsylvania, 209 S 33rd St, Philadelphia, PA 19104, USA 
e-mail: pop @math.upenn.edu 


D. Goldfeld et al. (eds.), Number Theory, Analysis and Geometry: 519 
In Memory of Serge Lang, DOI 10.1007/978-1-4614-1260-1_24, 
© Springer Science+Business Media, LLC 2012 


520 F. Pop 


Galois groups. In particular, the category of finitely generated infinite fields (up 
to Frobenius twist) should be equivalent to the category of their absolute Galois 
groups and open outer homomorphisms between these groups. A first instance of 
this situation is the celebrated Neukirch—Uchida theorem, which says that global 
fields are characterized by their absolute Galois groups. I will not go into further 
detail about the results concerning Grothendieck’s (birational) anabelian geometry, 
but the interested reader can find more about this in Szamuely’s Bourbaki Séminaire 
talk [34], Faltings’ Séminaire Bourbaki talk [10], Stix [35], and newer results by 
Mochizuki [19], Saidi-Tamagawa [33], Minhyong Kim [13], and Koenigsmann [15] 
concerning the (birational) section conjecture. 

The idea behind Grothendieck’s anabelian geometry is that the arithmetical 
Galois action on rich geometric fundamental groups (such as the geometric absolute 
Galois group) makes objects very rigid, so that there is no room left for non- 
geometric open morphisms between such rich fundamental groups endowed with 
arithmetical Galois action. 

On the other hand, Bogomolov [2] advanced at the beginning of the 1990s the 
idea that one should have anabelian-type results in total absence of an arithmetical 
action as follows: Let ¢ be a fixed rational prime number. Consider function 
fields K|k over algebraically closed fields k of characteristic # £. For each such 
function field K|k, let IT; := Gal(K"|K) be the Galois group of a maximal pro-€ 
abelian-by-central Galois extension K”|K. Note that if G) = Gx and G“t) := 
[G, Gx\(G)© fori > 1 are the central £° terms of the absolute Galois group 
Gx of K, then we have that Tx = G“)/G™ is the Galois group of the maximal 
pro-€ abelian subextension K’|K of K”|K, and 7; = G/G®); and denoting 
by G‘©) the intersection of all the G™, it follows that Gx(£) := Gx/G©) is the 
maximal pro- quotient of Gx. Now the program initiated by Bogomolov [2] has as 
ultimate goal to recover function fields K|k with td(K|k) > 1 as above from IT; 
in a functorial way. (Note that Bogomolov denotes [7% by PGal.) If successful, 
this program would go far beyond Grothendieck’s birational anabelian conjectures, 
as k being algebraically closed implies that there is no arithmetical action in the 
game. The program initiated by Bogomolov is not completed yet, and this paper is a 
contribution towards trying to settle that program; see the historical note below for 
more about this. 

Since this paper is quite abstract, let me announce the following “concrete” result, 
whose proof relies in an essential way on the Main Theorem of this paper (see 
Pop [30] for a complete proof): 

Theorem I Lert K\k be a function field with td(K |k) > 1 and k an algebraic closure 
of a finite field. Then the following hold: 


(1) There exists a group-theoretical recipe which recovers K|k from Tj. 

(2) The above group-theoretical recipe is functorial in the following sense: Let L|l 
be a function field with | an algebraically closed field, and let ® : Tx — IT, 
be the abelianization of some isomorphism ®° : IT; — IT;. Then denoting by 
Li and K' the perfect closures, there exist an isomorphism of field extensions 
1: Lill — K'\k and an €-adic unit ¢ € Z¥ such that ¢ - ® is induced by 1. 
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Moreover, the isomorphism 1 is unique up to Frobenius twists, and the ¢-adic 
unit € is unique up to multiplication by p-powers, where p = char(k). 

(3) Fora function field L\l as above, let Isom" (L, K) be the set of isomorphisms of 
field extensions 1: L'|l — K'|k up to Frobenius twists, and let Isom‘ (ITx, IT) 
be the set of abelianizations of continuous group isomorphisms IT; — IT; up 
to multiplication by t-adic units ¢ € Z;. Then there is a canonical bijection 


Isom'(L|/, K|k) — Isom®(ITx, IT). 


A sketch of a strategy to functorially recover K|k from pro-€ Galois information, 
in particular to prove the above Theorem I, can be found essentially already in (the 
notes of) Pop [25], and has as starting point the following simple idea: Let K be 
the £-adic completion of the multiplicative group K* of K|k.! Since the cyclotomic 
character of K is trivial, one can identify the €-adic Tate module Tx,¢ of K with Z, 
(non-canonically), and let 1x : Tx. — Ze be a fixed identification. Via Kummer 
theory, one has isomorphisms of £-adically complete groups: 


K = Homeont 7x, Te) “4 Homeont (Tx, Ze), 


i.e., K can be recovered from /7x, hence from /7;, via the projection IT; — ITx. 


On the other hand, since k” is divisible, K equals the €-adic completion of the free 
abelian group K*/k*. Now the idea of recovering K |k is as follows: 


(a) First, give a recipe to recover the image jx(K*%) = K%*/k”* of the 
é-adic completion functor jx : K* > K*/k* ¢ K inside the “known” 
K = Homeom UT, Zi). 
(b) Second, interpreting K*/k* =: A(K) as the projectivization of the infinite- 
dimensional k-vector space (K,+), give a recipe to recover the projective 
lines [.. := (kx + ky)’*/k™ inside A(K), where x, y € K are k-linearly 
independent. 
Third, apply the fundamental theorem of projective geometries of Artin [1], and 
deduce that K|k can be recovered from (XK) endowed with all the lines I, ,. 
Finally, show that the recipes above are functorial, i.e., they are invariant under 
isomorphisms of profinite groups 17x — JI, which are abelianizations of 
isomorphisms [7; -—> I/;. In particular, such isomorphisms [7x —> ITz 
originate actually from geometry. 


(c 


wa 


(d 


wa 


The strategy from Pop [25] to tackle the above problems (a), (b), (c), (d), above 
is in principle similar to the strategies (initiated by Neukirch and Uchida) for 
tackling Grothendieck’s anabelian conjectures. It has two main parts, as follows, 
the terminology being as introduced later: 


‘Recall that for an abelian group A, its €-adic completion is by definition A:= lim A/€°. 
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Local theory: It has as input the Galois/group-theoretical information J7;. It 
should be a recipe which in a first approximation recovers from J7; the decom- 
position/inertia groups of prime divisors of K|k in [Tx (N.B., not in [7;). The final 
output of the local theory should be the total decomposition graph Gener of Kk. 
This recipe should be invariant under isomorphisms J7x — IT, which are induced 
by some isomorphisms [7; — IT;. 


Global theory: Its input is the total decomposition graph Gags of K|k. It should be 
a recipe which in a first approximation recovers the geometric decomposition graphs 
Gm, (together with some of their special properties) for K|k from Gegor together with 
their sets of rational quotients Ax = {®,,},,. Ina second approximation, this recipe 
should recover A(K) and its projective lines from the %, endowed with their 
rational quotients Ax = {®,,},,. It thus should finally recover the function field 
K|k. Moreover, this recipe should be functorial, i.e., invariant under isomorphisms 
of total decomposition graphs Yo > Yq. 

This paper deals mainly with questions of the above global theory, precisely, 
recovering the geometric decomposition graphs (together with some of their special 
properties) from the total decomposition graph, and finally proving the main result 
of the paper, which is to show that morphisms of (total) decomposition graphs that 
are compatible with rational projections originate in a precise way from geometry. 

Before announcing the main result here, let us briefly introduce the main concepts 
and objects, which will be discussed/studied in detail later. 


e Prime divisor graphs (see Section 3 for more details) 


Recall that in the context above, a (Zariski) prime divisor of a function field K|k 
is a discrete valuation v of K whose valuation ring is the local ring @y x, of the 
generic point x; of some Weil prime divisor of some normal model X — k of K|k. 
If so, then the residue field Kv of v is the function field Kv = «(x;), and therefore, 
td(Ky|k) = td(K|k) — 1. A set of Zariski prime divisors D of K|k is called a 
geometric set if there exists a quasiprojective normal model X¥ — k of K|k such 
that D = Dy is the set of valuations v,, defined by the generic points x of all the 
Weil prime divisors of X. We next generalize the prime divisors of K|k as follows: 
First, for a valuation v of K the following are equivalent: 


(i) 0 is trivial on k, the residue field has td(Kv|k) = td(K|k) — r, and there exists 
a chain of valuations b} <--- < v0, := 0. 

(ii) v is the valuation-theoretical composition b = v, o-+-o v;, where v; is a prime 
divisor of K, and inductively, v;+; is a prime divisor of the residue function 
field k(v;)|k. 


A valuation v of K which satisfies the above equivalent conditions is called a 
prime r-divisor of K|k; and a sequence of prime divisors (v,,...,V1) as above 
will be called a Parshin r-chain of K|k. By definition, the trivial valuation will be 
considered a generalized prime divisor of rank zero, and the corresponding Parshin 
chain is the trivial Parshin chain. Finally, note that r < td(K|k), and that in the 
above notation, one has b; = v; 0---o vy; foralli > 1. 
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The total prime divisor graph A" of K is the following half-oriented graph: 


(a) the vertices of Z are the residue fields Kv of all generalized prime divisors » 
of K|k viewed as distinct function fields. 

(b) For given b = v, 0-+-o vy; and tv = w, 0-:-o wy, the edges from Kb to Kto are 
as follows: 


(i) Ifo = w,i.e., Ko = Ktv, then the trivial valuation v/t~ = tv/v is the only 
edge from Kv = Kt to itself; and it is by definition a non-oriented edge. 

(ii) If Ko # Kv, then the set of edges from Kv to Ktw is non-empty iff s = 
r+1landv; = w; for1 <i <r; andif so, then w, = tv/v is the only edge 
from Kv to Ktv, and it is by definition an oriented edge. 


A geometric prime divisor graph for K|k is any connected subgraph Dx of G 
which satisfies the following conditions: First, for each vertex Kv of Y, the set 
Dy, of all non-trivial edges of Y, originating from Kv is a geometric set of prime 
divisors of Kv|k. Second, all maximal branches of non-trivial edges of Y originate 
at K and have length equal to td(K |k). Equivalently, Y is a half-oriented connected 
graph having K = Ko as origin and satisfying: 


(a) the vertices of Ye are distinct function fields K;|k over k; 

(b) for every vertex K;, the trivial valuation of K; is the only edge from K; to 
itself. And the set of non-trivial edges v; originating at K;* is a geometric set 
of prime divisors of K;*«|k, and if v; is a non-trivial edge from K;* to K;, then 
K; = Kj*v;; 

(c) the only cycles of the graph are the non-oriented edges, and all the maximal 
branches consisting of oriented edges only have length equal to td(K |k). 


The functorial behavior of geometric prime divisor graphs is as follows: 


(1) Embeddings. Let L|I — K\k be an embedding of function fields which 
maps / onto k. Then the canonical restriction map of valuations Valk — Valz, 
v '& viz, gives rise to a morphism of the total prime divisor graphs 
g, : ZEe' — GS, which moreover is surjective. The relation between geometric 
prime divisor graphs Dx and Y, is a little bit more subtle; see Proposition 37: 
Given geometric prime divisor graphs YZ and Y, there exist geometric prime 
divisor graphs QR and oN containing ZY, respectively Y, such that y, defines 
a surjective morphism of geometric prime divisor graphs: 


0: Be > @. 


(2) Restrictions. Given a generalized prime divisor » of K|k, let F" be the set of 
all generalized prime divisors tv of K|k with v < tv. Then the map 


De > FS, wor w/b, 
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is an isomorphism of ho onto ia Moreover, if Kb is a vertex of some 
geometric prime divisor graph Yr for K|k, then one has that the maximal 
subgraph Dey of De whose initial vertex is Kv is a geometric graph of prime 
divisors of Kv. 


e Decomposition graphs (see Section 3 for more details) 


Let K|k be as considered above. Then we have the following, see e.g., Pop [28], 
Introduction, for a discussion of these facts: For every prime divisor v of K|k one 
has T, = Ty¢.x, and for every prime r-divisor » one has T, & ThK: Further, for 
generalized prime divisors » and tv one has Z, 1 Zw A 1 if and only if bv, 
are not independent as valuations, i.e, @ := Oy Ow #~ K; and if so, then @ is 
the valuation ring of a generalized prime divisor u of K|k which turns out to be 
the unique generalized prime divisor with T, = 7, M Typ, and also the unique 
generalized prime divisor of K|k maximal with the property Zy, Zp C Zu. 

In particular, » = tw iff Ty, = Ty iff Zp = Zp. Further, v < w iff T, C Ty 
strictly iff Zp D Zp strictly, and Tiy)/T) = Zi" if v is a prime r-divisor and tv is 
a prime s-divisor. 

We conclude that the partial ordering of the set of all generalized prime divisors 
v of K\k is encoded in the set of their inertia/decomposition groups T, C Dy. 
In particular, the existence of the trivial, respectively a non-trivial, edge from Kv to 
Kw in A" is equivalent to T, = Ty, respectively to Ty C Ty and T/T, & Ze. 

Via the Galois correspondence and the functorial properties of the Hilbert 
decomposition theory for valuations, we attach to the total prime divisor graph A 
of K|k a graph Gogo whose vertices and edges are in bijection with those of Z as 
follows: 


(a) The vertices of Gio are ITxy, viewed as distinct pro-£ groups (all v). 

(b) If the edge from Kv to Ktv exists, the corresponding edge from ITxy to Tx 
is endowed with the pair of groups Tip [ Zyp/p viewed as subgroups of ITxy; 
thus Tk» = Zro/o / Tyw/v- 


The graph Gogo will be called the total decomposition graph of K|k, or of Ix. 
If FRx. © Gs is a geometric graph of prime divisors of K|k, the corresponding 
subgraph G, © Gogo will be called a geometric decomposition graph for K|k, or 
for ITx. 

Next recall that the isomorphy type of (the maximal abelian pro-€ quotient of) 
the fundamental group I7;(X) := 2} it (X) of complete regular models X¥ — k, 
if such models exist, depends on K|k only, and not on X — k. Moreover, one 
can recover JT\(X) as being [7;(X) = ITx/Tx, where Tx is the subgroup of Gx 
generated by all the inertia groups 7, with v a prime divisor of K|k. This justifies 
calling the group [7.x := [x /Tx the birational fundamental group for K|k. As 
discussed at Fact 57, there always exist quasiprojective normal models X — k for 
K|k such that Tx = Tp,, where Tp, is the closed subgroup of [Tx generated by 
all T, with v € Dy. We will say that a model X — k of K|k and/or that Dy is 
complete regular-like if Tx = Tpy and the rational rank rr(€{(X)) of the divisor 
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class group €[(X) is positive, and for every normal quasiprojective model X with 
Dy © Dg one has that rr(€((X)) = rr(€U(X)) + |Dg\Dy|. Note that a complete 
regular like curve is a complete normal curve and viceversa. We say that a geometric 
decomposition graph %, is complete regular-like if, for all vertices v of Zr with 
td(Kv|k) > 0, one has that the set Dy of 1-edges of %,.,,, is complete regular-like. 

As shown in Proposition 22, there exists a group-theoretical recipe by which one 
can recover the geometric decomposition graphs (and the property of being complete 
regular-like) from the total decomposition graph Geogr. Further, by Proposition 39, 
that recipe is invariant under isomorphisms @ : Gq > Gog, ie., every such 
isomorphism gives rise by restriction to isomorphisms of the (complete regular-like) 
decomposition graphs for K|k onto the (complete regular-like) ones for L|I. 

The functorial properties of the graphs of prime divisors translate to the following 
functorial properties of the decomposition graphs: 


(1) Embeddings. Let1 : L|l — K|k be an embedding of function fields which 
maps / onto k. Then the canonical projection homomorphism ®, : Tx —> [Tz 
is an open homomorphism, and for every generalized prime divisor v of K|k 
and its restriction t;, to L, one has that ®,(Z») C Zp, is an open subgroup, and 
®,(T») S To, satisfies ®,(T,) = 1 iff vy is the trivial valuation. Therefore, ®, 
gives rise to a morphism of total decomposition graphs, which we denote by the 
same symbol 


@, : Gogo > Gop. 


In turn, for given geometric decomposition graphs Z and Y,, for which z gives 
rise to a morphism of geometric decomposition graphs Fy — Y, the above @, 
morphism of total decomposition graphs gives rise to a morphisms of geometric 
decomposition graphs ®, : %, — %@,, as defined later in Sections 4 and 5. 


(2) Restrictions. Given a generalized prime divisor v of Kk, let pr, : Zp > ko 
be the canonical projection. Then for eveary ty > v we have that Ty) C Zp are 
mapped onto Typ C Zp. Therefore, the total decomposition graph of Kv|k 
can be recovered from that of K|k in a canonical way via pr, : Zp > ITxp. 


e Rational quotients (see Section 5 for more details). Let K|k be a function field 
as above satisfying td(K|k) > 1. For every non-constant function t € K, let x, be 
the relative algebraic closure of k(t) in K. Since td(k;|k) = 1, it follows that K; 
has a unique complete normal model X; — k, which is a projective smooth curve. 
Therefore, the set of prime divisors of x;|k is actually in bijection with the (local 
rings at the) closed points of X;, thus with the set of Weil prime divisors of X;. 
Therefore, the total prime divisor graph ae for x;|& is actually the unique maximal 
geometric prime divisor graph for k;|k. We denote a simply by %,. 

Let 1; : k; — K be the canonical embedding, and ®, : x — I,, the 
(surjective) canonical projection. Then by the functoriality of embeddings, ®,, 
gives rise canonically to a morphism ®, : Gi — G,. Moreover, if %, is 
a geometric decomposition graph for K|k, then ®,, restricts to a morphism of 
geometric decomposition graphs ®,, : Ga, > Ge,. 
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In the above context, if x; = k(t), we say that ®,, is a rational quotient of 
Geogr as well as of every geometric decomposition graph %, for K|k. We call such 
t € K “general elements” of K, and usually denote general elements of K by x, 
in order to distinguish them from the “usual” non-constant t €¢ K. A “birational” 
Bertini-type argument shows that there are “many” general elements in K; see 
Lang [18], Ch. VIII, and/or Roquette [30], § 4, respectively Fact 43 in Section 5: 
For any given algebraically independent functions ¢, t’ € K, not both inseparable, 
tata 2= t/(a't' + a) is a general element of K for almost all a’,a € k. A set of 
general elements &' C K is a Bertini set if Y’ contains almost all elements tg’, for 
all t,t’ as above. We denote by Ax = {®,,},, the set of all the rational quotients 
of K|k, and consider subsets 21 C 21x containing all the ®,. € 2, x € SX’, with Y 
some Bertini set of general elements, and call them, for short, Bertini-type sets of 
rational quotients. 

The relation between rational projections and morphisms of geometric de- 
composition graphs is as follows: Let 1 : Lil <— K\|k be an embedding of 
function fields with 1(7) = k, such that K|:(L) a separable field extension, and 
td(L|/) > 1. Then there exists a Bertini-type set 8 = {®,,},, for L|/ such that 
Kx := 1(ky) is relatively algebraically closed in K for all ky. Hence for all ®,, € B 
and the corresponding ®. € 2K, Ky := 1(Ky), we get that the isomorphism 
Prey: Ge. > Gy defined by tx, °= tk, satisfies the condition 


Py, ° ®, = Px. e Px. 


Because of this property, we will say that ®, is compatible with rational quotients. 
e Abstract decomposition graphs 


It is one of our main tasks in the present manuscript to define and study abstract 
decomposition graphs, which resemble the geometric decomposition graphs %, 
(this will be done in Section 2) and to define proper morphisms of such abstract 
decomposition graphs, in particular their rational quotients (which will be done in 
Section 4). The abstract decomposition graphs, which endowed with families of ra- 
tional quotients resemble the complete regular-like geometric decomposition graphs 
as introduced above, will be called complete regular-like abstract decomposition 
graphs. 

The main result of this manuscript is the following; see Theorem 45 for a 
more general assertion, and Definition 21, Fact/Definition 43 (2), Definition 33 


(and Definitions 12 and 9), and Definition/Remark 34 for the definitions of all the 
terms: 


Main Theorem Let K|k and L|I be function fields with td(K |k) > 1. Let Gogo and 
Hoe be their total decomposition graphs, which we endow with Bertini-type sets 
of rational quotients 2, respectively 8. Then the following hold: 


(1) There exists a group-theoretical recipe which recovers K|k from Gogo endowed 
with 2. Moreover, this recipe is invariant under isomorphisms in the following 
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sense: Up to multiplication by t-adic units and composition with automor- 
phisms ®, of Genser defined by automorphisms 1: K'|k — K'|k, there exists 
at most one isomorphism ® : Goo > Ap of abstract decomposition graphs 
which is compatible with the sets of rational quotients A and SB. 

(2) The following more precise assertion holds: Suppose that td(L|1) > 1. Let Go, 
and Hg, be geometric complete regular-like decomposition graphs for K|k, 
which endowed with 2, respectively 8, are viewed as complete regular-like 
abstract decomposition graphs. Then for every morphism 


® : Gy, > Ho, 


which is compatible with the sets of rational quotients A and B, there exist an t-adic 
unit ¢ € ZF and an embedding of field extensions 


mils Rk 


such that ® = ¢- ®,, where ®, : Gy, — Hg, is the canonical morphism defined 
by 1 as above. 

Further, 1(1) = k, and 1 is unique up to Frobenius twists, and € is unique up to 
multiplication by powers of p, where p = char(k). 


We notice that the Main Theorem above (together with Propositions 22 and 39) 
reduces the problem of functorially recovering K|k from /7;., thus completing the 
proof of the above Theorem I, to recovering the total decomposition graph Ga 
of K|k and its rational quotients. In the case that k is an algebraic closure of a 
finite field, both these problems were solved in Pop [27], but working with the full 
pro-¢ Galois group Gx (£) instead of J7;. Nevertheless, the methods of Pop [27] to 
recover Gi and its rational quotients used only the set of all divisorial groups 
T, C Z, inside ITx. Using the local theory developed in Pop [28] instead of 
the local theory of Pop [27], a complete proof of Theorem I above is given in 
Pop [30]. 


Historical note 


The idea to recover K |k from JT; originates from Bogomolov [2], and a first attempt 
to do so can be found in his fundamental paper [2]. Although that paper is too 
sketchy to make clear what the author precisely proposes, a thorough inspection 
shows that it provides a fundamental tool for recovering inertia elements of 
valuations v of K (which nevertheless may be non-trivial on k). This is Bogomolov’s 
theory of commuting liftable pairs; see Bogomolov—Tschinkel [3] for detailed 
proofs. On the other hand, it is not at all clear how and whether one could develop 
a “global theory” along the lines (vaguely) suggested in [2], and there was virtually 
no progress on the problem for about a decade. 

A sketch of a viable global theory —at least in the case that k is an algebraic 
closure of a finite field— was proposed in the notes of my MSRI talk in the fall of 
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1999; see Pop [25]. In the second part of Pop [26], the technical details concerning 
the global theory hinted at in Pop [25] were worked out. Actually, this paper is 
an elaboration of parts of Pop [26], and the main theorem here, more precisely 
Theorem 45, is the Hom-form of the Isom-form of Theorem 5.11 of [26]. However, 
I should mention that in [26] the mixed “arithmetic + geometric situation” was 
considered as well as non-abelian Galois groups, which is of interest in the case 
that k is not algebraically closed. 

In the case that k is an algebraic closure of a finite field, \et me finally mention: 


¢ In the paper Pop [27], a recipe to functorially recover K|k from Gx(¢), in par- 
ticular a proof of (a slightly stronger form of) the above target result was given. 
First, the assertion one proves using Gx (€) instead of /7;. is stronger, namely, if 
® : IIx — I, is the abelianization of an isomorphism ®(£) : Gx (£) > Gz(£), 
then there exists an isomorphism 1 : L'|/ — K'|k (unique up to Frobenius 
twists) which defines ®; thus one does not need to “adjust” ® by multiplying 
by an ¢-adic unit e € Z/.I should also observe that the full Gx(€) was used 
in Pop [27] essentially only in order to recover the divisorial subgroups of ITx 
via the canonical projection Gx(€) — ITx, whereas all the other steps of the 
local and global theory are virtually identical with the ones in the case of IT;. 
(The recipe to recover the divisorial subgroups of [Tx via IT; — ITx is given in 
Pop [28] and uses Bogomolov’s theory of commuting liftable pairs as a “black 
box.” That recipe is used in Pop [30].) 

* Bogomolov—Tschinkel [4], [5], consider the case K = k(X), where X is a 
projective smooth surface over k. In the initial variant of their manuscript [4], 
they considered only the case that zr, (X) is finite, and proved that if 7; and 7; 
are isomorphic, then K|k and L|/ are isomorphic up to pure inseparable closures, 
provided k and / are algebraic closures of finite fields with chat # 2 (which is 
less precise than what the above target result gives in this case). Nevertheless, 
in the published version [5] of their earlier manuscript [4], they announce their 
main result for surfaces in a form almost identical with the target result above 
and use a strategy of proof which is in many ways very similar to that announced 
in Pop [25], and used in Pop [27]. 
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2 Pro-£ abstract decomposition graphs 


In this section we develop an abelian pro-£ prime divisor decomposition theory 
for “abstract function fields” which is similar in some sense to the abstract class 
field theory. Throughout @ is a fixed prime number, and 5 > 0 is a non-negative 
integer. 


2.1 Axioms and definitions 


Definition 1. A Jevel-5 (pro-£) abstract decomposition graph is a connected half- 
oriented graph Y whose vertices are endowed with pro-¢ abelian groups G; and 
whose edges v; are endowed with pairs of pro-¢ abelian groups 7,, C Z,, satisfying 
the following: 

Axiom (I): The vertices of Y are pro- abelian free groups G;, and Y has an origin, 
which we denote by Go = G. 

Axiom (II): The edges v; and the corresponding 7,, C Z,, satisfy the following: 


(i) For every vertex G; there exists a unique non-oriented edge vio from G; to 
itself, and the corresponding pair of pro-¢ groups is {1} =: T,,, C Zy,) = Gi. 
For all other vertices G;* #4 G; there exists at most one edge v; from G;« to 
G;. If v; exists, we say that v; is the oriented edge from G;« to G;, and v; is 
endowed with a pair T,, C Z,, of subgroups of G;* such that T,, = Ze and 
Gj = Zy;,/Ty;- 

The edges of Y are also called valuations of Y; in particular, the edges 
originating from G; are called valuations of G;. The non-oriented edge vjo 
from G; to itself is called the trivial valuation of G;, whereas the oriented 
edges v; originating from G;« are called non-trivial valuations of G;*. 

The groups 7,, © Z,, are called the inertia, respectively decomposition, 
groups of v;; and G; := Z,,/T,, is called the residue group of v;. 

(ii) For distinct non-trivial edges v; # v, originating from G;+*, one has 
Zy, 1 Zy, = 1, hence T,, 1 T,,, = 1 holds as well. 

For every cofinite subset {; of the set of non-trivial edges v; originating 
from G;+, let Ty, be the closed subgroup of G;+ generated by all the T,,, v; € 
U;. A system (Ll; 9) of such cofinite subsets is called cofinal, if every finite set 
of valuations v; as above is contained in the complement of L;. for some a. 

(iii) There exist cofinal systems (L;,¢)q such that 7T,, Ty,, = 1 for all a and 
all Vi ¢ Lia: 


Axiom (III): The non-oriented edges vj are the only cycles of the graph Y, and all 
maximal branches of non-trivial edges of Y have length equal to 6. 


Definition/Remark 2. Let ¥ be an abstract decomposition graph of level-dg on 
a pro-€ group G = Go. We will say that Y is a level-dy abstract decomposition 
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graph on G. A valuation of G = Go will be called a 1-edge of Y. If no confusion is 
possible, we will denote the 1-edges of Y simply by v; thus the corresponding pro-¢ 
groups involved are denoted by 7, € Z, and G, := Z,/T,. 


(1) Consider any 6 such that 0 < 6 < dg. By induction on 6 it is easy to see that 
has a unique maximal connected abstract decomposition subgraph containing 
the origin G of Y and having all branches of oriented edges of length 6. 
(2) Let » = (1,,...,¥,) be a path of length 5, := r of non-trivial valuations 
originating atG = Go. This means by definition that v; is a non-trivial valuation 
of Go, and if r > 1, then for all i < r one has inductively that G; is the residue 
group of v;, and v;+; is a non-trivial valuation of G;. In particular, G, is the 
residue group of v,. Then there exists a unique maximal connected subgraph 
%, of Y having Gy := G, as origin. Clearly, % is in a natural way an abstract 
decomposition graph of level dg — 5 on Gy. 

We say that % is an r-residual abstract decomposition graph of Y. In 
particular, the unique 0-residual abstract decomposition graph of # is ¥ itself. 
For every path » = (v;,...,v1) of length 6, = r as above, we will say that 
Gy, is an r-residual group of Y, precisely that Gy is the v-residual group of G. 
One can further elaborate as follows: For r > 1 we set ~ = (v,~-1,...,V1), 
and suppose that the inertia/decomposition groups Tw © Zr C Go of to have 
been defined inductively such that the residue group Gw := Zyw/Tr of 10 is 
Gy = G,,_,. We then define the inertia/decomposition groups Ty C Zy of b in 
Go as being the preimages of 7, C Z,, C Gy,_, via Zw > Zw/ Tm = Gy,_,. 
Note that by definition we have Z,/T, =: Gy and T, = 7. 

We call » = (v;,...,¥1) a generalized valuation of G = Go, or a multi- 
index of length 6, := r of Y. And we will say that 5, is the rank of v or that v 
is a generalized r-valuation if r = dy. 

Given generalized valuations b = (v,,...,V1), © = (Ws,...,W1), we will 
say that to < vif s <r,andv,; = w; for alli < s. From the definitions one gets 
that if t <b, then Z, C Zy and Ty C Ty. On the other hand, by Axiom II (ii), 
it immediately follows that the converse of (any of) these assertions is also true. 
We will say that » and to are dependent if there exists some g > 0 such that 
vi; = w; fori < q. For dependent generalized valuations b and tv as above, the 
following are equivalent: 


(3 


wm 


(a) g is maximal such that v; = w; fori < q. 

(b) Ty N Tw & Zj. 

(c) g is maximal such that Z,, Zp are both contained in the decomposition 
group of some generalized q-valuation of G = Go. 


(4 


SN 


In order to have a uniform notation, we take v = vo to be the trivial multi-index, 
or the trivial path, of Y as the unique one having length equal to 0. We further 
set Zy, := Go and 7,, = {1}. In particular, one has G,, = Z,,/Ty, = Go, 
which is compatible with the other notations/conventions. Further, vo < v for 
all multi-indices v. 
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Definition/Remark 3. Let Y be a level-dg abstract decomposition graph on the 
abelian pro-£ group G = Go. In notation as above, we consider the following: 


(1) Define Ay := Hom(G, Z.). Since G is a pro-€ free abelian group, Ag isa free 


£-adically complete Z¢-module (in €-adic duality with G). 


From now on suppose that dg > 0. Recall that T, C Z, and G, = Z,/T, denote 
respectively the inertia, the decomposition, and the residue groups at the 1-edges v 
of Y, i.e., at the valuations v of G. 


(2) Denote by T C G the closed subgroup generated by all the inertia groups 7, 


(3 
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(all v as above). We set IT, g := G/T and call it the abstract fundamental group 
of Y. One has a canonical exact sequence 


1o> TS GoelMg ol. 


Taking continuous Z,-Homs, we get an exact sequence of the form 


can ~ 


0-> ee -_ Hom(J7;, G, Ze) — Ag:= Hom(G, 2) = — i a — Hom(T, Z1). 


We will call Ua = Hom(/7j, G, Ze) the unramified part of Ag. And if no 
confusion is possible, we will identify U g With its image in Ag. 

Next we have a closer look at the structure of Ag. For every 1-edge v as above, 
the inclusions 7, —~ Z, © G give rise to restriction homomorphisms as 
follows: 


yo SZ resy 


jy’ : Ag —> Az, := Hom(Z,, Ze) —> Ar, := Hom(7,, Zz). 


(a) We set ia = ker(resz,) and 0, = ker(j”) and call them the principal v- 
units, respectively the v-units, in Ay. And observe that the unramified part 
of Ay is exactly Uwe = N,ker(y’). 

(b) The family (7"), gives rise canonically to a continuous homomorphism 
@, 7” of ¢-adically complete Z, -modules 


®@, 7": Ag > Ar & 6, Ar, 


Thus identifying Ar with its image inside @, Ar, one has 7% = @, 7” 
on Ag. We define Divy = @, Ar, and call it the €-adic abstract divisor 
group of Y. 

(c) Finally, we set gly = coker( 7) and call it the £-adic abstract divisor class 
group of Y. And observe that we have a canonical exact sequence 


Gg 
~ sane J — n~ 
0+Ue — Ay — Dive => Cle > 0. 
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(4) Let Agin := {x € Ay | 7"(x) = 0 foralmostallv}. We notice that 
by Axiom II (iii), the Zp-module Aggin is dense in Ay. Indeed, let (Uy)y be 
a cofinal system of 1-edges v. Then setting G, = G/Ty, and Ty = T/Ty,, we 
have a canonical exact sequence 


1> Ty > Gg > Ig > 1, 


and T, is generated by the images 7, of T, (all v ¢ Ug) in Gy. Clearly, the 
image of the inflation homomorphism inf, : Hom(Gz, Ze) > Hom(G, Ze) is 
exactly 


Ag = {x € Ag | 7° (x) = 0 for all v € Ly } = Nyex, ker(7"). 


Taking inductive limits over the cofinal system (Llq), the density assertion 
follows. 

We observe that 79% (Agtin) & = Ag g/ U g is a Ze-submodule of the Z,-free 
module @, Ar, & = @, Zp; hence oJ 4 (Agin) i is a free oe too. Therefore, 
for every Zy-submodule A C Ay, its image 7%(A) under 7% is a free Zy- 
module. The rank of 7% (A) will be called the corank of A. 

We notice that a Zp-submodule A C Ay has finite corank iff A is contained 
in ker(j”) for almost all v. Clearly, the sum of two finite corank submodules of 
Ag is again of finite corank. Thus the set of such submodules is inductive, and 
one has 


Agtin = U, (all finite corank A) = Ug Ag. 


(5) We say that Y is complete curve-like if the following holds: There exist 
generators t, of 7, such that I], Tt, = 1, and this is the only pro-relation satisfied 
by the system of elements { = (t,),. We call such a system T = (t1,), a 
distinguished system of inertia generators. 

We notice the following: Let Y be complete curve-like, and let T = (t,), 
and ’ = (t/), be distinguished systems of inertia generators. Then t/ = 1" for 
some £-adic units ¢, € Z,, because both t, and tv are generators of T,. Hence 
we have | = [[, t/ = [], t‘’. By the uniqueness of the relation [], t, = 1, it 
follows that ¢, = ¢ for some fixed €-adic unit ¢ € Z,. 

Next consider some 6 with 0 < 6 < dg. We say that Y is level-6 complete 

curve-like if all the (6 — 1)-residual abstract decomposition graphs Y% are 
residually complete curve-like. In particular, “level 1 complete curve-like” is 
the same as “complete curve-like.” 
For every 1|-vertex v consider the exact sequence | > T, ~ Z, > G, > 1 
given by Axiom II (i). Let inf, : Hom(G,, Zz) => Hom(Z,, Zz) be the resulting 
inflation homomorphism. Since T,, = ker(Z, — G,), it follows that resz, (Uy) 
is the image of the inflation map infl,. Therefore there exists a canonical exact 
sequence 


(6 


wm 


0—> o. —s 0, a Hom(G,, Zc) = Ag, > 0, 


and we call 7, the v-reduction homomorphism. 
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(7) 


In particular, if dg > 1, then dg, = dg — 1 > 0 for every 1-vertex v, and we 
have the corresponding exact sequence for the residual abstract decomposition 
graph % Gy 


We will say that Y is ample if 6g > 0 and the following conditions are satisfied: 


(i) Fa : Ag — > @®yex Ar, is surjective for every finite set »’, where 
ft" = Overs”. 
(ii) If dg > 1, then the following hold: 


(a) jg) = Og, and Ue, + Jv(Aggin al U,) a Ag, fin for every v. 
(b) For every finite-corank submodule A C Azw, there exists v such that 
A C U,, and A and 7,(A) have equal coranks. 


Notice that the condition (ii) above is empty in the case dg = 1. Thus if 
dg = 1, then condition (i) is necessary and sufficient for Y to be ample. 

Next consider 0 < 6 < dy. We say that Y is ample up to level 6 if all the 
residual abstract decomposition graphs %, for » such that 0 < 5, < 6 are ample. 
In particular, “ample up to level 1” is the same as “ample.” 


2.2 Abstract Ze) divisor groups 


Definition 4. (1) Let M be the ¢-adic completion of a free Z-module. A Z- 


(2 
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(4) 


(5) 


submodule 4) © M of M is called a Ze)-lattice in M (for short, a lattice) 
if M. is a free Zg)-module, it is ¢-adically dense in M, and it satisfies the 
following equivalent conditions: 


(a) M/E = M/E. 
(b) 4p) has a Zig)-basis S$ which is ¢-adically independent in M. 
(c) Every Ze)-basis of “%, is C-adically independent in M. 


Let N C 4 © M be Zw@-submodules of M such that N and M/N are ¢- 
adically complete and torsion-free. We call 4) an N-lattice in M, if “e)/N 
is a lattice in M/N. 

In the context above, a true lattice in M is a free abelian subgroup ./@ of M 
such that 4c) := M@ ® Zw is a lattice in M in the above sense. And we will 
say that a Z-submodule .#@ C M is a true N-lattice in M if N C -@ and 
M/N is atrue lattice in M/N. 

Let M be an arbitrary Zp-module. We say that subsets M,, M2 of M are €- 
adically equivalent if there exists an f-adic unit e € Ze such that My = e- M, 
inside M. Further, given systems S| = (x;); and Sy = ();); of elements of M, 
we will say that S; and S» are ¢-adically equivalent if there exists an €-adic unit 
€ € Ze such that x; = « y; (alli). 

We define correspondingly the €-adic N -equivalence of N -lattices, etc. 


534 F. Pop 


Construction 5. Let Y be an abstract decomposition graph on G which is level-é 
complete curve-like and ample up to level 6 for some given 6 > 0. Recall the last 
exact sequence from point (4) from Definition/Remark 3: 


& =~ 2 can 74 
6307 = Ae = Ding Ss Oe = 6: 


The aim of this subsection is to describe the £-adic equivalence class of a lattice 
Divg in Diva, in case it exists, which will be called an abstract divisor group of . 
In case the lattice Divg C Divg exists, it satisfies 


Divg ® Ze = @, Ar,. 


Further, the existence (of the equivalence class) of the lattice Divg will turn out to 
be equivalent to the existence (of the equivalence class) of a U g-lattice Ag in Ag, 
which will turn out to be the preimage of Divg in Ag.In particular, if Ag exists, it 
satisfies 

Ag &® Ze = Agin- 


The case 6 = 1,i.e., Y is complete curve-like and ample. 

In the notation from Definition/Remark 3 (5) above, let F = (t,), be a distinguished 
system of inertia generators. Further, let Az be the abelian pro-¢ free group on the 
system & (written multiplicatively). Then one has a canonical exact sequence of 
pro-£ groups 


Ze 


lo > Fe eT Ol, 


where t = [,, 1, in ¥g is the pro-£ product of the generators 1, (all v). Observing 


that Hom(.¥s, Zz) > Dives in a canonical way, and taking €-adically continuous 
Homs, we get an exact sequence 


0> Ar = Hom(T, Ze) > Divy = Hom(¥s, Ze) > Ze = Hom(r™, Ze) > 0, 


where the last homomorphism maps each ¢ to its “trace”: @ b> (x Py, p(t)). 
Thus Ar consists of all the homomorphisms g € Hom(.¥5, Ze) with trivial trace. 
Consider the system SB = (g,), of all the functionals g, € Hom(.¥5, Zz) — 


Divy defined by 9,(t,) = 1 if v = w, and g,(ty) = 0 for all v 4 w. We denote by 
Divs = (B)w@) C Divy 


the Zg-submodule of Hom(.¥z, Zr) = Divy generated by B. Then Divz is a 


lattice in Dive, and 8 is an £-adic basis of Dive. We next set 
Dive := {0 ayy € Divs | 1a, = 0} = Divs N Ar. 


Clearly, Dive is a lattice in Ae And moreover, the system (e, = Qy — Qv wey is 
an ¢-adic Z,¢)-basis of Dive for every fixed v. 
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The dependence of Divs on T = (t,), is as follows. Let T’ = (t/), = LY with 
e € Zi be another distinguished system of inertia generators. If 8’ = (yj), is the 
dual basis to {’, then ¢- 8B’ = B. Thus B and B’ are £-adically equivalent, and we 
have Divs = e€- Dive and Dive =€-: Div’. 

Therefore, all the subgroups of Dive of the form Divg, respectively Dive, are 
¢-adically equivalent (for all distinguished {). Hence the ¢-adic equivalence classes 
of Divs and Dive do not depend on %, but only on Y. 


Fact 6. In the above context, denote by Ag the preimage of Div, thus of Divs, 
in Ag. Consider all the finite-corank submodules A C Aggin with Ug Cc A. Then 
the following hold: 


(i) Ag isa U ¢-lattice in Ag, and Ax C Agim. 
(ii) AN Ag is a Ug-lattice in A (all A as above). 


Moreover, j"(Az) = Ze)Q, (all v). 
Proof. Clear. Oo 
Definition 7. In the context of Fact 6 above, we define objects as follows: 


(1) A lattice of the form Div; C Dive will be called an abstract divisor group 
of Y. We will further say that Dive is the abstract divisor group of degree 0 in 
Divs. 

(2) The Ug-lattice Ag is called a divisorial U g-lattice for Y in Ag. And we will 
say that Az and Divs correspond to each other, and that { defines them. 

e Note that Ag C Agin and Ag ® Z = Agin- Indeed, if x € Aw, then 
J’ (x) = 0 for almost all v, etc. 


The case 6 > 1. 

We begin by mimicking the construction from the case 6 = 1, and then conclude the 
construction by induction on 6. Thus let £ = (1,), be any system of generators for 
the inertia groups 7, (all 1-edges v). Further let ¢ be the abelian pro-¢ free group 
on the system { (written multiplicatively). Then T is a quotient Fz >~ T > 1 
in a canonical way. Observing that Hom(.¥5, Za) = Divy in a canonical way, by 
taking £-adic Homs we get an exact sequence 


0 + Hom(T, Zc) > Hom(¥5, Zr) = Dive. 


Next let 8 = (9,), be the system of all the functionals g, € Hom(.7z, Zz) defined 
by 9,(t) = 1 if v = w, and ¢,(1,) = 0 for all v 4 w. We denote by 


Divs = (B)w C Hom(.¥z, Z:) 


the Z,e)-submodule generated by B. Then 8 is an ¢-adic basis of Hom(.7z, Zc), 


i.e., Dive is €-adically dense in Divg = Hom(.¥¢, Zi), and there are no non-trivial 
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£-adic relations between the elements of 8. We will call 8 = (g,), the “dual basis” 
to ©, and remark that Div is a lattice in Hom(T, Zz). 

Finally, let £’ = (t’), be another system of inertia generators, and suppose that 
Y’ = Y* for some e € Ze. If B’ = (¢g’), is the dual basis to LY’, then eg’ = g, 
inside Hom(T, Ze). Thus ¢ - 8’ = %. In other words, & and 8’ are ¢-adically 
equivalent, and we have Divs = ¢- Diva. 


Fact 8. In the notations from above let a U ,-lattice Ag, Cc Ay, with Oe, C Ag, 
be given for every valuation v of Y. Then the following hold: 


(1) Up to é- adic equivalence, there exists at most one Ug- lattice Ag in Ag such 
that first, Uy Cc Ag C Aggn, and second, for every finite-corank submodule 
A of Agen with Ug C A and the corresponding Ay := j,(AN ee + 
Ue, Cc Ag, in the following hold: 


(i) Aa:=A nN Ag is a U ¢-lattice in A. a 
(ti) Jy(Aa A Uy) + Ug, is a Ug-lattice in A,, which is €-adically Ug,- 
equivalent to Ag, Ay. 


Moreover, if the U g-lattice Ag exists, then its £ -adic equivalence class depends 
only on the £-adic equivalence classes of the U g,-lattices Ag, (all Vv). 
(2 


SYS 


In the above context, suppose that FY is ample, and that the Ug- lattice Ag 
satisfying the conditions (i), (ii), exists. Then Oy, + wlAgn U, ) isa Uy, - 
lattice, which moreover is t-adically Og, -equivalent to Ag, (all v). 


Proof. To (1): Let Ay, Ay be U g-lattices in Ag satisfying the conditions from (1) 
above. Let A € Agiin have finite non-zero corank, and satisfy Oy Cc A. By the 


ampleness of Y, it follows that there exists v such that, first, A C U,, and second, 
Aand A, := 7,(A)+Ug, have equal coranks. Therefore, 7, defines an isomorphism 


of A/Uyg onto A,/U¢@,, and one has 
(*) ker(n)NACUg, p(A)N Gg, S p(Gg). 


For A as above, set A’, = AN Ay. Then by hypothesis (i), it follows that A, and A‘, 
are both U g-lattices in A. Further, by hypothesis (ii), both Ay, := Ue, + p(Aa) 
and A’, := Oe, + Jy (A‘,) are Ug, lattices in A,, which are both equivalent to the 
Og, attics Ag, 1 A,. Therefore, there exists e ¢ ZF such that A’, =e-Ay,. 


Claim. A’, =e- Ag. 


Indeed, A’, = €+ Aa, implies that j,(A‘ ) C e+ (Aa) + Uy. Hence for every 
ee A’, there exist e € Ay and u, € Og, such that j,(e’) = e€s(e) + wy. 
Therefore we have uy = j,(e’ — ee) € 7(A), and hence wu, € 7,(A)N Oy. 
Hence by assertion (*) above, there exists u € U g such that j,(u) = uy; thus 
Jv(u) = jv(e’ — ee). But then we have e’ — (ce + u) € ker(y,) M A, thus 
e’—(ee+u) € Ug by assertion (*). We conclude that e’ € ee+Ug. Sincee’ € A’, 
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was arbitrary, we have A’, C e- Aa + Og. On the other hand, by hypothesis we 
have Oy C Ag and U g C A’,. Hence the above inclusion is actually equivalent 
to A’, C e- Ay. By symmetry, the other inclusion also holds, and we finally get 
A, =E€: Ay. 

We also observe that ¢ is unique up to multiplication by rational ¢-adic units, 
because A’, / Ug =e- A’, / Oy are £-adically equivalent lattices in the non-trivial 
Ze-module A/tyg. Hence recalling that Ag = Uy, Ay and Ay = Uy A%4, and 
taking into account the uniqueness of ¢, one immediately gets that Ay = ¢- Aly, as 
claimed. 

To (2): First, since Ay = Uy, Ay as mentioned above, it follows from 
hypotheses (i), (ii), that Ue, + plAg Nn U,) is €-adically equivalent to some 
U ,-sublattice of Ag,, as this is the case for all the Cg, + p(Aan Us). After 
replacing Ag, by some properly chosen f-adic multiple, say ¢- Ay, with e € Zf, 
without loss of generality, we can suppose that j,(Ag MN U v) © Ag,, and thus 
Oe, + (Ag n U. .) © Ag. For the converse inclusion, let " C Ay, be a finite- 
corank submodule. Then by the ampleness of Y, see Definition/Remark 3 (7) Mi), 
there exists a finite-corank submodule A C Ag such that 7 C U g + WAL nU v). 
But then by properties (i), (Gli), we get TN Ag, © Og, + (Aan UC, = 
Og, + WlAgn Uh Since I" was arbitrary and Ag, = Og, + Ur(U 7 Aw), 
the converse inclusion follows. oO 


Let Y be an abstract decomposition graph which is both level-6 complete curve- 
like and ample up to level 6 for some 6 > 1. In particular, all residual abstract 
decomposition graphs “ to non-trivial indices v of length 6, < 6 are both /evel- 
(6 — dy) complete curve-like and ample up to level (6 — dy); and if 5) = 6 — 1, then 
4, is complete curve-like and ample. Hence if 6, = 6 — 1, then Y% has an abstract 
divisor group Dive, as defined/introduced in Definition 7. In the above context, let 
us fix notation as follows: 


Definition 9. In the above context, we define an abstract divisor group of G (if it 
exists) to be the lattice defined by any system % of inertia generators as above, 


Divy := Divs C Divg, 


which together with its preimage Ag in Ag satisfies inductively on 6 the 
following: 


(i) Abstract divisor groups Divgy, exist for all residual abstract decomposition 
graphs Y,. Let Ag, be the preimage of Dive, in Ag, (all v). 

(ii) Ag satisfies conditions (i), (ii) from Fact 8 for all finite corank submodules 
A c Ag with respect to the preimages Ag, defined at (i) above. 


¢ Note that if Ay exists, then Ay C ee and Ag ® Z; = A gin. Indeed, if 
x € Ag, then 7”(x) = 0 for almost all v, etc. 
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Remarks 10. Let Y be an abstract decomposition graph which is level-6 complete 
curve-like and ample up to level 6 for some 6 > 0. Suppose that an abstract divisor 
group Divg := Divs for Y exists, and let Ag be its preimage in Ag. Then one 
has: 


(1) The homomorphism 7” : Ay = Hom(G, Z,) —> Hom(T,, Ze) = Zc, gives 
rise by restriction to a surjective homomorphism 


y° 2 Ag => Zi) Qy- 


Indeed, by condition (i) of the ampleness, see Definition/Remark 3 (7), it 
follows that 7’(Ay) = Zv@,. Further, since Ay is €-adically dense in Ay, 
it follows that 7”(Ag) is dense in Ze g,. Thus the assertion. 

Moreover, the U @-lattice Ag endowed with all the homomorphisms ;” deter- 
mines Divg as the additive subgroup 


(2 


wa 


Divy = Zoe = Dy J"(Ag) C Divy 


generated by the 7"(Ag) for all the v. Therefore, giving an abstract divisor 
group Divg is equivalent to giving a U g-lattice Ag in Ay such that inductively 
we have: 


(i) Ag satisfies conditions (i), (11) from Fact 8 with respect to the preimages 
Ag, of some abstract divisor groups Diva, (all v). 
Gi) j’(Ag) = Zw (all v), and Ag is the preimage of 6,7"(Ag) via 7" 


(3) Finally, for an abstract divisor group Divg for Y and its preimage Ag in Ag, 
we set Cla, = Divg / 7% (Ag) and call it the abstract ideal class group of Ag. 


Thus one has a commutative diagram of the form 


wm 


bate! 2. can 
C3: Up ey = Biye == cig a 


(*) , tt . J { 


0-> Ug Ay 2 Divy = Fly > 0 


where the first three vertical morphisms are the canonical inclusions, and the 
last one is the £-adic completion homomorphism. 


Proposition 11. Let Y be an abstract decomposition graph which is level-6 com- 
plete curve-like and ample up to level 6 > 0. Then any two abstract divisor groups 
Divg and Divy for Y are ¢-adically equivalent as lattices in Div. Equivalently, 
their preimages Ag and Avy in Ag are £-adically equivalent Ug-lattices in Ag. In 
particular, there exist distinguished systems of inertia generators T and &' defining 
Divy, respectively Div, which are €-adically equivalent, i.e., {' = F for some 
f-adic unit ¢ € ZF. 
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Proof. We prove this assertion by induction on 6. For 6 = 1, the uniqueness is 
already shown, see Fact 6, and Definition 7 in case 6 = 1. Now suppose that 6 > 
1. Let Divg, and Diviy, be abstract divisor groups for ¥ used for the definition 
of Divg, respectively Div (all v). By the induction hypothesis, Divy, and Diviy, 
are €-adically equivalent. Thus their preimages A, and A’ in Ace are ¢-adically 
equivalent Ug y-lattices. Therefore, by Fact 8, the lattices Ag and Ay (which are 
the preimages of Divg respectively Div in Ag) are £-adically equivalent. Finally, 
use Remark 10 (2), above to conclude. oO 


Definition 12. Let Y be an abstract decomposition graph which is level-6 complete 
curve-like and ample up to level 5. We will say that Y is a divisorial abstract 
decomposition graph if it has abstract divisor groups Divg = Divs as introduced 
above. If this is the case, we will denote by Ag the preimage of Divg in Ag, and 
call it a divisorial U g-lattice in Aw 


3 Abstract decomposition graphs arising from algebraic 
geometry 


3.1 Some general valuation-theoretical nonsense 


Let K be an arbitrary field. The space of all equivalence classes of valuations Valx 
of K is in a canonical way a partially ordered set by v < w iff O, C G@, iff 
m, C my,, and if so, then m, C GO, is a prime ideal of ,, and @,, is the localization 
O7 = (@y»)m,. The unique minimal element of Valx is the trivial valuation vp which 
has O,, = K as valuation ring. Further, the minimal non-trivial elements of Valx are 
exactly the rank-one valuation rings of K (which then correspond to the equivalence 
classes of non-archimedean absolute values of K). Note that if v < w, then @,,/m, is 
a valuation ring in the residue field Kv of v. We denote the corresponding valuation 
of Kw by w/v, and call it the quotient of w by v. Conversely, given v € Valx anda 
valuation w of Kv, the preimage @ of @; under @, — Kv is a valuation ring of a 
valuation w > v such that w/v = w. We define Wov := w, and call it the composition 
of w and v. Valx has in a canonical way the structure of a (half-oriented) graph with 
origin K = Kyo as follows: 


(a) The vertices are the residue fields Kv indexed by v € Valx. 

(b) The set of edges from Kv to Kw is non-empty if and only if v < w and 
rank(w/v) < 1. If so, then w/v is the unique edge from Kv to Kw. We say 
that w/v is a non-trivial oriented edge if rank(w/v) = 1, respectively we call 
w/v a trivial non-oriented edge if v = w, ie., w/v is the trivial valuation 
of Ky. 


We will call the graph defined above the valuation graph for K. There are two 
functorial constructions one should mention here: 
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(1) Embeddings. Leti : L — K bea field embedding and g, : Valk — Val,, 
Vv b> vy ‘= v|z, the canonical restriction map. Then g, is surjective and com- 
patible with the ordering of valuations. And if v < win Valx, then vy, < wy, 
in Valz, and rank(w,/vz) < rank(w/v). Hence if the edge w/v from Kv to Kw 
exists, then the edge w/v, from Lv, to Lwz exists too. Therefore, y, defines 
a canonical projection from the valuation graph of K onto the valuation graph 
of L, under which Kv is mapped to Lv;, and the edge w/v from Kv to Kw (if 
it exists) is mapped to the edge w,/v, from Lv, to Lw,. Note that if w/v is a 
non-trivial oriented edge such that w, = vz, then w/v is mapped to the trivial 
non-oriented edge of Lw, = Lyv_. 

(2) Restrictions. Let Kv be the residue field of v, and let Val, = {w € Valx | w > v} 
be the set of all refinements of v. Then Val, — Valx,, w +> w/v, is a 
canonical bijection which respects the ordering, thus defines an isomorphism 
of the subgraph Val, of the valuation graph for K onto the valuation graph 
Valx, for Kv. 


e The Galois decomposition theoretical side 

Let £ be a fixed prime number as above. For every field K which contains the £°° 
roots of unity, let K’|K be a maximal pro-¢ abelian extension, and we denote by 
ITx = Gal(K’|K) its Galois group. For v € Valx and prolongations v’ of v to K’, we 
have that the inertia/decomposition groups Ty C Z, of the several prolongations 
v’|v are conjugated under J7Tx; hence these groups are equal, as JTx is commutative. 
We will denote them by 7, © Z,, and call them the inertia/decomposition groups 
at v. Recall that 7x, = Z,/T, canonically. 

Via the Galois correspondence and using the functorial properties of Hilbert 
decomposition theory, we attach to Valx a graph (a, which is in bijection with 
Valx and has vertices and edges as follows: The vertices of ai, are indexed by 
the (distinct) pro-£ abelian groups /7Tx,. Concerning edges, if v/w is the unique edge 
from some Kw to some Ky (hence, either w = v and v/w is the trivial valuation on 
Kv = kw, or v < w and rank(w/v) = 1 on Kv), then the unique edge from J7Tx, 
to ITx, is the pair of groups T,/;, © Z\y/, viewed as subgroups of ITx,. Note that 
in case w/v is the trivial valuation, we have merely by definition that 7,,;, = 1 and 
Lw/v = [Txy. 

We will call Aa, the valuation decomposition graph of K, or of ITx. 

Note that the above functorial constructions concerning embeddings and restric- 
tions give rise functorially to corresponding functorial constructions on the Galois 
side as follows: 


(1) Embeddings. Let 1 : L <> K be an embedding of fields, and consider 
a prolongation 1’ : L’ <> K’ of 1. Then 1’ gives rise to a projection 
@, : IIx — IT,, which in turn gives rise canonically to a morphism of valuation 
decomposition graphs, which we denote by ®, again: 


®, : Gary > Gra, - 
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Note that ®, maps the profinite group /7x, at the vertex Kv into the profinite 
group /7;,, at the corresponding vertex Lv,;. And concerning edges, ©®, 
maps Ty/, C Zy/, into the pair Ty), SC Zy,/v, Of the corresponding 
inertia/decomposition subgroups of wi /vz in ITzy,. 

(2) Restrictions. For w € Val,, one has Z, C Z, and 7, C T,,. And under the 
canonical projection Z, — ITx,, every T,, © Z,, is mapped onto Ty, © Zy/) 
in [Tx,, etc. 


3.2 Recovering the geometric decomposition graphs 
from the total decomposition graph 


Let K |k be a function field as introduced in the introduction. We notice that the total 
graph of prime divisors Zi of K|k, as defined in the introduction, is the subgraph 
of Valx whose vertices are the generalized prime divisors of K|k and whose non- 
trivial edges are of the form tw/bv with tv > bv generalized prime divisors. (If so, 
then tv/v is a prime divisor of Kv|k.) We also recall that a subgraph Z of G" 
was called a geometric graph of prime divisors for K|k if for every vertex v of Zr, 
the following hold: First, the trivial edge from Kv to itself is an edge of Z, and 
second, the set of non-trivial edges Dy originating from Kb form a geometric set of 
prime divisors of Kv|k. 

Concerning the Galois theoretical side, the total decomposition graph Goo 
of K|k, or of ITx, is the subgraph of the valuation decomposition graph vai, which 
is defined by the total prime divisors graph A. And a geometric decomposition 
graph for K|k, or for ITx, is any subgraph %, of Goo which corresponds to a 
geometric graph Y of prime divisors. 

In this subsection we give a recipe to recover/describe the geometric decompo- 
sition graphs %, for K|k inside the total decomposition graph Gener of K|k using 
only the Galois theoretical information encoded in Gu. 

We begin by recalling a criterion for the description of the geometric sets of 
prime divisors of a function field K|k, as presented in Pop [24], Subsection 2 D), 
an idea which was used in essence already in Pop [27]. 

Let K|k be a function field over an algebraically closed base field as usual. For 
every normal model X — k of K|k, we denote by Dy the set of all the prime 
divisors of K|k defined by the Weil prime divisors of X. 


Fact 13. For a set D of prime divisors of K\k, the following conditions are 
equivalent: 


(i) Forallnormal models X — k of K\k one has that D and Dx are almost equal. 
(Recall that two sets are almost equal if their symmetric difference is finite.) 

(ii) D is geometric, i.e., there exists a quasi-projective normal model X — k of 
K\k such that D = Dy. 
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Recall that a line on a k-variety X is an integral k-subvariety of X , which is a curve 
of geometric genus equal to 0. We denote by X'"* the union of all the lines on X. 

We will say that a variety X — k is very unruly if the set X'* is not dense in X. 
In particular, a curve X is very unruly iff its geometric genus gy is positive. 
Further recall that being very unruly is a birational notion. In particular, it makes 
sense to say that a function field K|k with td(K|k) = d > Ois very unruly if K|k 
has models X — k which are very unruly. 

Suppose that d > 1. We call a prime divisor v of K|k very unruly if Kv|k is very 
unruly as a function field over k. A prime divisor v of K|k is very unruly iff there 
exist a normal model X — k of K|k and a very unruly prime Weil divisor X; of X 
such that v = vy,. 

The following is a more precise form of Proposition 2.6 from Pop [27], but see 
rather Pop [24], Section 3, for details: 


Proposition 14. With the usual notation, the following hold: 


(1) A set D of prime divisors of K|k is geometric iff there exists a finite ¢- 
elementary subextension Ko|K of K'|K of degree €¢ such that for every 
£-elementary subextension K,|K of K'|K of degree €*4 containing Ko, one 
has that D consists of almost all prime divisors v of K|k whose prolongations 
vi|v to K,|L are very unruly prime divisors of K. 

(2) Let L|K be a finite subextension of K'| K. Then a set Dy of prime divisors of 
L|k is geometric iff there exists a geometric set of prime divisors D of K|k such 
that D, is almost equal to the prolongation of D to L. 


Proof. The proof is more or less identical with the one from Pop [24]; thus we 
refer the reader to that work for the details: Choose some transcendence basis 
(t1,...,tg) of K|k and a “sufficiently general” separable polynomial p(T) € k[T] 


of degree > 3. Form = 1,...,d, consider u, € K’ with a = p(tm). Then 
Ky = K[uy,..., ug] has degree £“ over K, and it does the job; see [24] for details. 
oO 


Using the proposition above, one deduces the following inductive procedure on 
d = td(K|k) for deciding whether a given set D of prime divisors of K|k is 
geometric, respectively whether a finite subextension L|K of K’|K viewed as a 
function field L|k is very unruly. 


Criterion 15. By induction on d, we consider criteria Paon(D) and Be!) (L|K) 
for sets of prime divisors D of K|k to be geometric sets of prime divisors, 
respectively for finite subextensions L| K of K’| K to be very unruly, as follows: 


(1) Cased = 1: 
. PQ m(D): D is almost equal to the set of all prime divisors of K |k. 
BM (L|K): The genus of the complete normal model of L|k satisfies 
Sik > 0. 
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(2) Cased > 1: 


° Poam(D): With Ko|K and K,|K as in Proposition 14, the set D is almost 
equal to the set of all prime divisors v of K|k whose prolongations v;|v to 
K,|K satisfy AS) (Ky) | Ky). 

° De (L|K): There exists a set D of prime divisors of K|k such that 
Poom(D) holds, and for almost all v € D, the prolongations w|v of v to 
LIK satisfy AS. (Lw| Ky). 


Remarks 16. (1) As mentioned in the introduction, if » is a generalized prime 


Q 


(3 


(4 


wa 


wm 


wa 


divisor of K|k, then via the canonical projection pry : Zy — ITx», one can 
recover the total decomposition graph of Kv|k as follows: The generalized 
prime divisors of Kv|k are precisely the valuations of the from w/v with tw 
a generalized prime divisor satisfying » < tv. In turn, these are exactly the 
generalized prime divisors tv such that T, C Ty, or equivalently Zy C Zy. If 
so, then Tiny C Zyp/p are the images of Ty» C Zp; thus the total decomposition 
graph of Kv |k can be recovered from the total decomposition graph of K|k via 
Pro. 

The finite subextensions L|K of K’|K are in bijection with all the open 
subgroups A C JTx. And if » is a generalized prime divisor of K|k, and tv is a 
prolongation of b to L, then under the canonical projection pr, : Zp — ITxKy we 
have that if L| K corresponds to A C ITx, then the finite residual subextension 
Lro| Kv of Kv'| Kv corresponds to the open subgroup A, := pr,(Zp» M A) of 
IT xp. 

Let Y Cc Gono be a connected full subgraph containing the origin [7x of Goo 
and having all maximal oriented branches of length d = td(K|k). (Here “full” 
means that for all vertices /7xy and [Tx of Y one has that if the edge to/» from 
Kv to Kto exists, then this edge endowed with Typ C Zyp/ is contained in Y.) 
In particular, the following hold: 


(a) For for every vertex ITxy of Y, the trivial edge from ITx,y to itself endowed 
with the inertia/decomposition group of the trivial valuation {1} C ITxp 
belongs to Y. 

(b) If 7x, and ITx belong to Y, and tv/v is a prime divisor of Kv, then the 
edge tv/v endowed with Tj C Zp belongs to FY. 


(c) All maximal branches of non-trivial edges have length d := td(K|k). 


Let J C G* be the (connected full) subgraph defined by Y. For every vertex 
Ko of J, or equivalently a vertex ITxy of Y, let Dy be the set of prime divisors 
v of Kv|k which are the non-trivial edges of J originating from Kv. Then by 
the definitions one has the following: 

G is a geometric decomposition graph iff Dy is a geometric set of prime divisors 
of Ko|k for every vertex Kv of Z, and all maximal oriented branches of G have 
length td(K |k). 
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(5) We conclude that recovering/describing the geometric decomposition graphs 
inside Yor is equivalent to recovering/describing the geometric sets of prime 
divisors of the function fields Kv |k for all generalized prime divisors bv. 


We do this by showing that the geometric Criterion 15 can be recovered from, 
respectively interpreted in, the group-theoretical information encoded in Gop. 


Gal-Criterion 17. 

By induction on dy :=  td(Kv|k), we give criteria Gal Fein (D) and 
Gal A\) (L| Ko) for sets of prime divisors D of Kv|k to be geometric sets of prime 
divisors, respectively for finite subextensions L| Kv of (Kv)’| Kv to be very unruly, 
as follows: 

Case dy = 1: 

Then Kv|k is the function field of a complete smooth curve X, — k with function 
field k(Xy) = Ko. And the set of all non-trivial generalized prime divisors equals 
the set of prime divisors of Kv|k, which is Dy,. Let (T,), be the system of all 
divisorial inertia groups in J7x» (which is part of the hypothesis, as [7x» comes 
endowed with the total decomposition graph of Kv|k, hence encodes the set of all 
the 7,, v € Dy,), and let Tx» be the closed subgroup of I7xp generated by all 7,,. 
Then ITxp/Txy = nt a (Xp) is the pro-f abelian fundamental group of Xp. Since 
char(k) 4 €, it follows that a) a i , where gy is the genus of X,. For 
every non-empty set D C Dy, of prime divisors of Kv|k, let Tp be the closed 
subgroup of 7x, generated by T,, v € D. Then ITxy/T>p is a pro-€ abelian free 
group on 2g, +r — 1 generators, where r = | Dy, \DJ. Since D is geometric iff r 
is finite, we get that D is geometric iff 7x /T>p is topologically finitely generated. 
Hence Powm(D) is equivalent to: 


° Gal Pym (D): ITxy/Tp is finitely generated. 


Let L| Kv be a finite subextension of Kv’| Kv corresponding to A C ITxy as above, 
and let wy : xy — ITxy/A = Gal(L|Kv) be the corresponding finite quotient. 
For every prime divisor v of Kv|k and a prolongation w|v of v to L, we have that 
the inertia group of w|v in G = Gal(L|Kv) is precisely 7,, := 2 (T,); hence the 
ramification index of w|v is e., := |z_(7,)|. Further, by the fundamental equality, 
the number 77, of prolongations of v to L can be computed as [L : Kp] = nz y-e7y. 
Thus applying the Hurwitz genus formula, one has 2g, —2 = [L: Kv](2gp —2) + 
», niv(er» _ 1). And since n, ,, = [L : Kv]/e_y, we see that g, > 0 iff either 
Sy > Oor 0,1 —1/ez.,) = 2. Therefore, taking into account that e,, = |x, (T,)|, 
and that g ie 0 iff Tk» = Tk», we get that the geometric criterion BP) (D) is 
equivalent to 


© Gal A? (L|Kv): Either xo # To or otherwise, (1 —1/|21(T.)|) = 2. 


Case dy > 1: 

Recall that the total decomposition graph of Kv|k can be recovered from Goo 
as mentioned above. Hence without loss of generality (and in order to simplify 
notions), we can suppose that Kb = K. In particular, we will denote by v the 
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prime divisors of K|k, and for finite subextensions L|K of K’|K, we denote by 
w|v the prolongations of v to L. Note that since L|K is abelian, thus Galois, all 
the prolongations w|v are conjugated under Gal(L|K), and thus have the same 
ramification indices and residue function fields Lw|k, which are isomorphic over 
Kv|k. 

Now let D be a set of prime divisors v of K|k. Then Parn(D) is equivalent to 


° Gal Fem (D): With Ko|K and K,|K as in Proposition 14, the set D is almost 
equal to the set of all prime divisors v of K\|k whose prolongations v,|v to K,|K 
satisfy Ga A? (Ki | Kv). 

Similarly, let L|K be a finite subextension of K’|K. Then the geometric criterion 

POL |K) is equivalent to the following: 


> GalA (L|K): There is a set D of prime divisors of K|k satisfying Gal Fen(D) 


such that for almost all prime divisors v € D, all the prolongations w\v of v to 

LIK satisfy GAP 4 (Lw| Kv). 

This concludes the proof of the claim that the geometric sets of prime divisors 
of each Kv|k can be recovered from the total decomposition graph of Kv|k, thus 
from that of K|k. 


3.3. Geometric decomposition graphs as abstract 
decomposition graphs 


Let K|k be a function field over an algebraically closed field k with char(k) # €. 
Generalizing the divisor graphs of prime divisors from the introduction, we define a 
level-6 geometric prime divisor graph for K|k as being a (half) oriented graph Fx 
defined as follows: 


(I) The vertices of Ye are distinct function fields K;|k over k. And Yr has an 
origin which is Ko := K. 

(II) For every vertex K;, the trivial valuation vjo of K; is the only edge from K; to 
itself, and we view this edge as a non-oriented one, or a trivial edge. Further, 
the set of all the oriented edges starting at K;« is a geometric set D;» of prime 
divisors v; of K;*. We call these edges non-trivial, and if vy; € D;* is sucha 
non-trivial edge from K;* to K;, then K; = K;*v;. In particular, td(K;|k) = 
td(K;*|k) — 1. 

(I) The trivial valuations are the only cycles of Y, and all the maximal branches 
of non-trivial edges of Z have length equal to 6, hence 6 < td(K|k). 


As indicated above, we attach to Y the corresponding subgraph G% C Waly. 
Hence by definition one has: 
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(I) The vertices of %, are in bijection with the vertices of Zr, via the Galois 
correspondence, i.e., the vertices of %, are the pro-f groups [Tx, with K; 
vertex of Ye. In particular, Tx, := Ix is the origin of %,. 

(Il) The edges of %, are in bijection with the edges of Y. The trivial edge vio 
from K; to itself is endowed with {1} =: Ty) C Zyjo := Ik;, 1.€., with 
{1} C Ix,. Every non-trivial edge v; is endowed with the inertia/decomposi- 
tion groups T,, C Z,,. In particular, if vy; is an edge from Kj to Kj = Kjvj, 
then ITx, = Z,,/T,,. 

(I) The trivial valuations are the only cycles of %,, and all the maximal branches 
originating from J7Tx, and consisting of non-trivial edges of %, have length 6. 


Proposition 18. With the above notation, G, is a level-5 abstract decomposition 
graph. 


Proof. Indeed, all the axioms of an abstract decomposition graph are more or 
less well-known facts concerning Hilbert decomposition theory for valuations. For 
instance, if v; is a prime divisor of K;*|k, then all the prolongations v;, of v; to 
Kj, are conjugated under JTx,,.; therefore, their inertia, respectively decomposition, 
groups are equal, say equal to 7,, C Z,,. Further, 7,, X Ze, and the residue field 
K!..v, equals (K;*v;)', thus (Kj*v;)' = Z,,/T,,, etc. Moreover, for prime divisors 
v; 4% w; of K;*|k one has the following; see e.g., Pop [28], Introduction, and 
especially Proposition 2.5 (2): The decomposition groups Z,, and Z,,, have trivial 
intersection. And finally, if X;« —> k is any quasiprojective normal variety, and 
Dy,, is the set of Weil prime divisors of X;*, then every open subgroup of JTx,, 
contains almost all inertia groups T,,. Indeed, in every finite separable extension of 
Kj» only finitely many Weil prime divisors of X;* are ramified, etc. oO 


Remarks 19. Let %, be a level-é abstract decomposition graph as as above, and to 
simplify notation a little bit, set A R= Ag, _ U K I= U Gog» Divx — Dive, lies 
and Clr i Clay, - We next analyze/describe the abstract objects U Des A Des 
Div Dx and Clay and relate the abstract exact sequence 


OK 
n~ pass a, — n~ 
1- UGH, - AK — DivER > Cl, > 0 


to the geometry of K|k as reflected in the geometric information encoded in the 
prime divisor graph Yr. In order to do so, let us consider some normal model 
X — k of K\k such that D := Dy is the set of 1-edges of the given %. 
Without loss of generality, we can and will suppose that X is quasi-projective. 
Then by Krull’s Hauptidealsatz, Y%p := I'(X, @y)* depends on D only, and not 
on X; #p(K) := K*/ Wp is isomorphic to the group of principal divisors on X. 
Hence since Div(D) := Div(X) depends on D only, and not on X, it follows that 
€l(D) := €l(X) depends on D only, and not on X; one has a canonical exact 
sequence 


div, rr 
0 > %p(K) 3 Div(D) —> €1(D) > 0, 
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and the resulting exact sequence of ¢-adically complete groups: 
— div) —~ ~ 
0 > Teewn) > Hp(K) — Div(D) —> €l(D) > 0, 


where Ty e¢i(p) = iim im ,€l(D), with n = €° and e = 0, is the ¢-adic Tate module 
of €1(D). Since 1 ~ Up [k* > K*/k* > Hy lp (K) — 1 is an exact sequence of 
free abelian groups, so is 1 > ~U p = K = BF (K) —> 1. Hence if Up C K is 
the preimage of Ty ep) > HK) under K —> Xp (K), we finally get an exact 
sequence of the form 0 > Up a Kk = Div(D) > €1(D) — 0, and therefore, 
Us fits canonically into an exact sequence | > YZ p > ie —> Teewpo) > 0. 


(1) By Kummer theory we have an identification: Ax = Hom(/7 K, Ze) = K. 

(2) Concerning/describing U gx: Recall that we defined U i= Hom(/7 LQ» Zi), 
where IT, g, := [x/To, and Ta, is the group generated by all the inertia 
groups 7, with v all the 1-edges of Y. By the definitions, we have Tg, = Tp 
and IT. g, = IT\,p. Further, in the notations from Fact 55, it follows that IT; p 
is the Pontryagin dual of Ago, which fits canonically in the exact sequence 


0- YW ® Qe/Ze > Aw > ¢o f(D) > 0. 


Let Ap C Ago be the maximal divisible subgroup. Since Wp @ Qe/Z, is 
divisible, it follows by Fact 54 that Ao fits into an exact sequence of the form 


0- ® Qe/Ze — Ao > (0 Ap (X) > 0, 


and Ay / Ap finite. Hence /7;,p has finite torsion, UZ = Hom(IT), 9% ; Zz) is 
the Pontryagin dual of Ao, and we get an exact sequence 1 > Uv D— U Dn 
Teeup) — 0. Finally U K = U p, and this gives the precise description of 
U x iM geometric terms. 


(3) Concerning Divg,: For every prime divisor v one has a commutative diagram 


wm 


n 


x cers vK 
(*), { le 


Hom(/7x, Ze) as Hom(7,, Ze) 
The diagrams (*), with v € D give rise canonically to a commutative diagram 


K —Div(D) = @,vK > €l(D) > 0 
1 {Se i 
Hom(I7x, 21); Hom(T,, Ze)“ Pa, > 0 
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where the vertical maps are isomorphisms, and PR is simply the quotient 
of the middle group by the first one. Therefore, the resulting canonical 
identification K > Hom(/7 K> Z) =! Ag, gives rise a canonical isomorphism 
Div(D) > ©, Hom(T,, Ze) =: Diva. 

Finally, the above identifications K > Hom(/7Tx, Ze) and Div(D) > Div 
give rise to an identification €l(D) > Px = Clog. Hence by the structure 
of €((D) := €1(X) given in Fact 54, it follows that Clg, ~ El(D) = A\(X), 
and is thus a finite Zy-module. 


(4 


wa 


Fact 20. With the above notation, let Div’(D) be the preimage in 

Div(D) := Div(X) of the maximal ¢-divisible subgroup El (D) of ED) := El(X). 

Then one has: 

(1) Div’ (D)— Div(D) gives rise to. an embedding Div’ (D)— Div(D). 

(2) Div’ (D) = ker( Div(D) > €l(D)) = = div, (K), and Div'(D) = Div’ (D)N 
Div(D). 

(3) Let D > D be geometric sets with Il, p = IT, 5. Then Div’ (D) C Div'(D) 
has finite bounded index. Finally, for every D large enough, Div'(D) Cc Div(X) 


depends on K|k only, and CUD) = ~ €l(D) Q a, 


Proof. To (1) and (2): We get a commutative diagram of the form 


0 > %p(K) > Div'(D) > €'(D) > 0 


| 1 

0 > %(K) > Div(D) > €l(D) > 0 
a + 
G > © 


where ZY = Div(D)/Div'(D), @ = €l(D)/€l(D), and 9 > @ is an 
isomorphism. By the structure of €((D) = €((X) in Fact 54, @ := €(X)/€l'(X) 
is of the form @ = C; + C, with C, a finite abelian €-group, and C) a finitely 
generated free abelian group. Hence we get embeddings of the -adic completions 
© > @ and J & J. Further, since €l(D)’ is £-divisible, its £-adic completion 
is trivial, hence €(D) + @ isan n_ isomorphism. And finally, the middle (exact) 
column defines an exact sequence Div’ (X) Oo Div(X )—> GQ - 0. The remaining 
assertions follow easily in the same way, by chasing in the commutative diagram 
above. 

To (3): Let D = Dy and D = Dy for some normal models X — k and Y > k 
of K|k. By Fact 55, especially assertion (4), it follows that Vy = Wy, and with 
Y — k as in Fact 55 we have that Aios(Y) C Ao(Y) C A;(Y) are birational 
invariants of K|k, etc. Oo 
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Definition/Remark 21. Consider notation as above. 


(1) A geometric set D := Dy of prime divisors for K |k is called complete regular- 
like, if yp = IT\,x and €l(D) has positive rational rank and for every 


geometric set of prime divisors D > D one has €(D) = €\(D) ® ZIPXP 


¢ Note that if ¥ — k is a complete regular variety, then Dy is complete regular- 
like, but the converse is not true in general. Nevertheless, if X is a curve, then 
Dy is complete regular-like iff X is a complete regular curve. 


(2) Let Z bea level-5 geometric graph of prime divisors for K|k. For each vertex 
v of Yr, let Dy is the set of non-trivial 1-edges of Z with origin Kv. We 
say that Zr is complete regular-like, if Dy is complete regular-like for all 
with td(Ko|k) > 0. 

The complete regular-like prime divisor graphs for K|k are abundant. More- 
over, for every geometric prime divisor graph FZ C G there exist complete 
regular-like decomposition graphs Fx with A. C Dx. Indeed, we proceed by 
induction on the transcendence degree as follows: Let D’ be the set of 1-indices 
of ZH. Let D D D’ be any complete regular-like set of prime divisors of K. 
Then td(Kv|k) < td(K|k) for all v © D, hence for every v € D by induction 
one has the following: There exists complete regular-like prime divisor graphs 
Br, for Kv|k. Moreover, if v € D’, then there exists a complete regular-like 
prime divisor graph Yx, which contains the residual prime divisor graph F, 
of Z,. Then the resulting prime divisor graph Y having D as set of 1-indices 
and Yx, as residual prime divisor graph at each v € D is by definition complete 
regular-like. 


(3 


wm 


Combining the above discussion with the one in the previous subsection, we get: 
Proposition 22. In the above notations and context, the following hold: 


(1) The geometric decomposition graphs Gq, for K|k can be recovered by a group- 
theoretical recipe from the group-theoretical information encoded in Goer. 

(2) Moreover, given a geometric decomposition graph Gg,, the fact that Go, is 
complete regular-like can be recovered from the total decomposition graph Gao 
endowed with Go,. 

(3) In particular, the complete regular-like decomposition graphs Ga, for K|k can 
be recovered from the group-theoretical information encoded in Gar. 


Proof. To (1): This is more or less the Gal-Criterion 17 combined with Remarks 16. 

To (2): Let Ye be a geometric graph of prime divisors for K|k, and %, the 
corresponding geometric decomposition graph for K |k. For every vertex Kv of Z, 
let X, — k be a normal model of Kv|&k such that D, = Dy,. Let Tp, © Mk 
be the closed subgroup generated by all the inertia groups 7,, v € Dy. Further, 
by Remark 16 (1), the total decomposition graph of Kv|k can be recovered from 
Geogr. In particular, the set of the inertia groups 7,, of all the prime divisors w of 
Ko|k, hence the closed subgroup Txy generated by all these inertia groups, can be 
recovered from %o endowed with %,. Therefore, from Yj endowed with %_ 
one can check whether the following hold: 
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(a) {T, | v € Dy} equals the set {7;, | w all the prime divisors of Kv |k}. 

(b) Tk» = Tp,. 

(c) €l(Dy,) is not finite. - 

(d) Cl(D») = El(Dy) a oa for every geometric set Dy > Dy, of prime 
divisors of Ko|k. 


Note that (a) holds iff X, is a complete normal curve. Indeed, if dim(X,) > 1, 
then for every given normal model Y, — k of Ko|k there exist infinitely many 
prime divisors of Kv|k which are not Weil prime divisors of Y,. And if Xy is 
a normal curve over k, then the set of prime divisors of Kv|k is in bijection 
with the set of all the points in the normal completion of X,, etc. Further, the 
conditions (b), (c), (d), are by definition the necessary conditions for D, to be 
complete regular-like. This concludes the proof of assertion (2). 

To (3): This follows immediately by combining assertions (1) and (2) above. oO 


Proposition 23. With the above notation, let Go, be a complete regular-like 
decomposition graph for K|k. Then the following hold: 


(1) Go, viewed as an abstract decomposition graph of level § := td(K|k) is 6 
complete curve like and ample up to level 5. Thus G, is divisorial. 
(2) Let D be the set of \-edges of Gx. Then the following hold: 


(a) Uz is the £-adic dual of IT,x = IT\,p, and thus it depends on K |k only, 
and not on G,. Therefore we denote Ge = O o%.- 

(b) Div(D)@ := Div(D) ® Zvy is an abstract divisor group of Go, which we 
call the canonical abstract divisor group of Go. 

(c) The preimage Ag, of Div(D) ) in K will be called the canonical 
divisorial U 9, -lattice of Gz,, and it has the following description: 


° Let Div’(D) © Div(D) be the preimage of the €-divisible subgroup €( (D) 
of €(D). Then Div'(D)@ = divj(AqH) and div, (AGH)/An(K)y = 
Cl (D)@. 


(3) Up to multiplication by t-adic units, the canonical U x-lattice Ax := Ax, 
depends only on K|k, and not on x. 


Proof. To (1): First, the fact that %, is complete curve-like follows from the fact 
that all the (6 — 1) residual function fields Kv|k have td(Kv|k) = 1, and the facts 
that %, is complete regular-like. In order to show that %, is ample up to level 
6 = td(K|k), we have to show that conditions (i), (ii), from Definition/Remark 3 (7) 
are satisfied. First, condition (i) follows immediately from the weak Approximation 
Lemma. To check condition (ii) is a little bit more technical though. Since %, 
is complete regular-like, for every multi-index v of %,, there exists a complete 
regular-like model X, — k of Kv|k such that Dy, is the set of 1-vertices of Y,, ; 
hence in particular, one has IT, xp = IT\,py,. And we will denote by X¥ > k the 
corresponding model of K|k; thus IT,,x = IT,,p,- 
We carry out induction on d = td(K|k) > 1 as follows. 
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Let AC K be an £-adically closed submodule. Then A gives rise functorially 
to a subextension K,|K of K’|K by setting Ky := U,, Ky, where K, := K[X/Ay, ], 
and A, C K*/n is the image of A in K*/n = K/n for alln = €°. (Note that since 
Ac R is closed, A is the projective limit of the A,,’s inside K .) We notice that 
K,,|K is Z/n elementary abelian with Galois group equal to Hom(A,, jz,,); thus 
Ka|K has Galois group Hom,,(A, Te), where Ty is the Tate module of the jugco 
roots of unity, which we have identified with Z,. 

Further, let v be an arbitrary valuation of K, and vy a prolongation to Ky. Then 
under the above correspondence one has the following: The decomposition field of 
va|v is K,,, where Ap := AN ker(j,), and the inertia field of valv is K4,, where 
A, := AN Cx In particular, v is unramified in K,|K iff A C c. 

Checking condition (ii) (a) from Definition/Remark 3 (7): The main technical 
tool for the proof is Theorem B from Pop [29], which implies the following: Since 
A:= Ux, is the £-adic dual of IT,,x, it follows by the definition of IT,,x that the 
corresponding subextension K,4|K is the maximal subextension of K’|K in which 
all prime divisors v of K|k are unramified. But then by [29] it follows that all the 
k-valuations of K|k are unramified in K4|K. And correspondingly, the same is true 
for all the residue function fields Kv|k of Zr. Now for v a fixed prime divisor of 
K|k, let V be the set of the k-valuations v = wovof K|k, with w € Dy, the set 
of prime divisors defined by the complete regular-like model X,, > k mentioned at 
the beginning of the proof. Since U, Cc U, and ply) = Ce forb = wov, it 
follows that setting A, := Npyey Uys we have 


Ay ce and 7,(Ay) = I. Us) EMp w(Oy) = Ow 0, a U2: 


On the other hand, by the discussion above, all the k-valuations of K|k are 
unramified in K,|K. Hence in particular so are all the » € ¥Y; thus A C U b 
for all » € Y. We conclude that A C A,. Therefore, 7, maps U De =: A into 
W (Ay) © Uae, as claimed. 

For the second assertion of condition (ii) (a) from Definition/Remark 3 (7), 
let K fie K be the union of all the finite-corank submodules | of K, and define 
Kviin Cc Kv correspondingly. We then have to show that U De Iv (K ant j)y= Rviin- 
Let AC K fin be a finite-corank Z;-submodule. Since A has finite corank, there exits 
a cofinite subset D’ C Dy such that v'(A) = 0 for every v’ € D. Therefore, if x, 
is the center of v on X, for every sufficiently small open neighborhood X’ Cc X, 
we have v’(A) = 0 for all v € Dy’, v # v. In particular, since X is normal, thus 
smooth at x,, we can choose X’ C_X to be smooth such that x, € X’ and w(A) = 0 
for allw € Dy, w # v. Since AN U, is contained in Up, it is sufficient to 
show that (OU D a is contained in Kin, hence mutatis mutandis, we can suppose 
that A := Up, If so, K,4|K is the maximal subextension of K’|K in which all 
v’ € Dy: are unramified. Let ¥ > k bea projective normal completion of X (note 
that ¥ — k exists, because X’ C X, and X is normal quasi-projective), and set 
S:=X \X’; hence S is a proper closed subset of X which does not contain x,. Let 
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further X,, C X be the closure of Xy in =. and set S:= SN Xx,, thus S C X,, is 
a proper closed subset. Further, we view X,, — & as a projective, thus proper (not 
necessarily normal) model of Kv|k. For every k-valuation w of Kv|k we claim the 
following: 


Claim. Suppose that w(j,(A)) 4 0. Then the center x,, of w on X,, lies in S. 


Indeed, since w( j(A)) # 0, it follows that setting » = wo v as a valuation 
of K|k, we have v(A) # 0. Therefore, by the introductory discussion above, it 
follows that » is ramified in Ky|K. We claim that the center x» of v on X lies 
in S. By contradiction, let x» € X’. Since X’ is smooth, hence regular, by the 
purity of the branch locus, one has I7;,p,, = IT,(X). Hence every finite cover 
Y'’ — X’ defined by some open subgroup of J7\,y, is étale. Hence the cover 
Y' xy Spec @, — Spec Oy is étale, thus unramified. Therefore, v is unramified 
in K,|K, contradiction! Since O, C @,, by the valuative criterion for properness, 
we have OY x, Cc OF x,» and x, lies in the closure of {x,} in a hence in X,,. 
In particular, since » has no center on X’, it follows that x» € S = Xx, < 
Finally, using the valuative criterion for properness again, it follows that viewing 
Xx, as a projective (not necessarily normal) model of Kv|k, the center of w on Xx, 
is precisely x,. This concludes the proof of the Claim. 

Using the claim above, we finish the proof of (11) (a) from Definition/Remark 3 (7) 
as follows: For every geometric set of prime divisors D, of Kv|k, only finitely many 
w € D, have center in S'. Hence by the claim, only finitely many w € D, satisfy 
w( j(A)) # 0. From this we conclude that 7,(A) has finite corank. 


Checking (ii) (b) from Definition/Remark 3 (7): Let A C K fin be a finite corank 
Ze-module. Further let X’ C X be a smooth open subvariety such that A C U Dyiz 
As in the proof of (ii) (a), mutatis mutandis, it sufficient to check (ii) (b) for the 
Z -submodule of finite-corank A := Nyep,, U yi 

In order to do so, let X’ C X be a normal projective completion of X’, and 
‘a= PY a projective embedding. If H is a general hyperplane, and 
Z := XH and Z' := X'N H are the corresponding hyperplane sections, 
it follows that Z’ <> X’ is a prime Weil divisor such that Z’ — k is smooth, 
because X’ —> k was so. Further, Z’ <> X’ gives rise to a surjective group 
homomorphism J7,(Z’) — IT,(X'), which is an isomorphism if dim(X’) > 2. 
Hence since X’ and Z’ are smooth, thus regular, by the purity of the branch locus we 
have IT;.p,, = I1\(X’) and IT,,p,, = I\(Z’); thus we get a surjective projection 
IN\.p,, — IN.py,. Let v := vz, be the prime divisor of K|k defined by the 
Weil prime divisor Z’ of X’. Then taking ¢-adic duals, it follows as in the proof 
of (ii) (a) above that the surjectivity of the projection JT,,p,, — IT\,p,, implies that 
Ine i > Us is injective, as claimed. 

To (3): It follows immediately from (the proof of) assertion (1) above, together 
with Remarks 19 (2), (3), and (4) and Fact 20 and Fact 55 (4). oO 
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4 Morphisms and rational quotients of abstract 
decomposition graphs 


4.1 Morphisms 


Let Y and # be given abstract decomposition graphs of levels dg and 6, based on 
G = Gp, respectively H = Ho. We denote as usual by T, C Z, and G, = Z,/T, 
the 1-edges, respectively the 1-vertices, of Y, and correspondingly by 7, C Z,, and 
Gy = Z,/T, those for #. Further, Y, and “%, are the corresponding 1-residual 
abstract decomposition graphs, which have then level dg — 1, respectively dy — 1. 
We also recall that vo and wo are the trivial valuations of G, respectively H, and that 
their inertia groups are trivial by definition. 


Definition/Remark 24. In the above context we define: 


(1) Let ® : Go — Hp be a (continuous) group homomorphism. Let v and tv be 
multi-indices for Y and #. We define inductively on the length of v the fact 
that t~ corresponds to » via ® as follows; see Definition/Remark 2, especially 
points (3) and (4), to recall notation: 


(i) The trivial multi-index t» = wo corresponds to v if and only if (7,) = 1 
and ®(Z,) is open in Ho. And the only to which corresponds to the trivial 
multi-index tv = vo is the trivial multi-index tv = wo. 

(ii) Suppose that tt = (wy,...,w,) and » = (y,;,...,v,) are both non- 
trivial, and let us set » = (b;,v,) and to = (tv), w ,) with v; and tv, the 
corresponding multi-indices for the residual abstract decomposition graphs 
G,,, respectively %,,. (Note that v; and/or tv; might be trivial.) Then we say 
that to corresponds to v if and only if one of the following hold: 


(a) If &(7,,) = 1, then under ®,, : G,, = Z,,/T,, > Ho, inductively one 
has that to corresponds to v1. 

(b) If O(7,,) A 1, then O(7,,) C T,, and O(Z,,) C Ty, are open 
subgroups, and under ®,, : G,, = Z,,/Ti, > Zw,/Tw, = Aw, 
inductively one has that tv; corresponds to vy). 


(2) Let tw correspond to some v. Then for every tv’ < 1, there exists v’ < » such 
that tv’ corresponds to v’. The proof of this assertion follows easily by induction 
on the length of v, and we will omit it. 

Finally, let %g and Vv be the sets of the multi-indices v of Y, respectively tv 
of #, and let Wg. C Vz be the set of all v € %g such that there exists some 
tw, € Vy which corresponds to v. Then the correspondence defined at (1) 
above gives rise to a map Go : Vg,o > Vee, 0t> go(b) = tv := Wy. 

If go(v) = tv, we say that ® maps v to tv, or that w is the image of 
under @. 
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Definition 25. With the above notation, let 6 < dg, 6. be a non-negative integer. 
We define a level-5 morphism ® : Y — # inductively on 6 and dg as follows: 


(1) A level-zero morphism ® :G > is any group homomorphism @ : G > H 
under which wo corresponds to vp. Equivalently, ® is open. 

(2) A level-5 morphism ® : GY > # is any level-zero morphism  : G > H 
which inductively on 6g and on 6 > 0 satisfies the following: 


(i) Almost all 1-vertices of # correspond to some 1-vertices of Y, and every 
1-vertex of # corresponds to only finitely many (maybe to none) of the 
1-vertices of Y. 

(ii) If the trivial valuation wo corresponds to a 1-edge v, then dg, = dy—1 > 4, 
and the canonical group homomorphism ®, : G, = Z,/T, > Ho defines 
a level-6 morphism of the corresponding residual abstract decomposition 
graphs Y, and #. 

(iii) If wis a l-edge corresponding to the 1-edge v, then the group homomor- 
phism ®, : G, = Z,/T, > Z,/Ty = Hy defines a level-(5— 1) morphism 
of the corresponding residual abstract decomposition graphs Y, and %,. 


Remarks 26. In the above context, let d : Y > # bea level-6 morphism of 
abstract decomposition graphs. 


(1) The morphism @ gives rise to a Kummer homomorphism 
Aw i= Hom(H, Zz) eile Hom(G, Z,) =: Ag, gRogo®. 


Since ® has an open image, and Ag and A y are torsion-free, ¢ is injective. 


From now on suppose that 6 > 0, and that v and tv are the multi-indices of Y, 
respectively #, which correspond to each other. Let 6, and 6,, be their lengths, and 
suppose that dy <6. 


(2) By : % => Heo has level (5 — 6,) and the resulting residual Kummer 
homomorphism ae A He, > Ag, is injective by the remark above applied 
to Dy. 

(3) To simplify notation, let us set Az, = Hom(Z,, Ze) and Ar, = Hom(T», Zz), 
thus in particular, Ar, = ZS The inclusions T, <— Z, © G and the 
canonical exact sequence 1 > T, > Zy — Gy — | give rise in the same 
way as at Definition/Remark 3, points (3) and (6), to morphisms of ¢-adically 
complete Ze-modules as follows: 


~ resz ~ resp ~ inf ~ resp 
jy’: Ag —> Az, —> Ar, and 0— Ay, —> Az, — Ar, > 0. 
: . al Pe 
In particular, setting U,, := ker(resz) and Uy = ker(j”), we get exact 
sequences 


v 


. re oe Al af o> 
0 > Uy — Ag — Ar, > 0 and 0-0, 0 0, Ay, 0. 
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The surjective morphism Jy : Us > Ag, is called the canonical v-reduction 
homomorphism. 


(4) By induction on 6, and 6), one gets the following: (Z,) C Zw and O(Z,y) 
is open in Z,,, and ®(7,) C Ty and ®(7,) is open in T,,. Hence since @ is 
open and restricts to an open homomorphism Zy — Zy and Ty > Typ, by 
taking €-adic duals we get commutative diagrams with injective columns and 
exact rows as follows: i 

Uw —> Az, — Ar, Ty > Um > Ase, 

Ls Ls Js and eds Joo 

a a mm Al a be 

Oy Ae, = Ae UC, SU, > Ae 
(5) A special case of the above discussion is that ) = v and tv = w are |-vertices. 


If t, and t, are inertia generators at v, respectively w, there exists a unique 
Avy € Ze such that @(t,) = to. And we have commutative diagrams dual to 


each other: 
T; — G A Fé Zz Ze Pw 
le le and \3 la vw 
Ty — H Ag ay Ze Py 


where @,, and @, are as in Construction 5. Further, the horizontal maps in the 
first diagram are the inclusions, and the last vertical map in the second diagram 
denotes the Ze-morphism defined by gy > dyyw@y. 


Definition/Remark 27. Let 2 : @ —> # be a level-6 morphism of abstract 
decomposition graphs. We will say that: 


(1) @ is proper if first each to corresponds to some bv and every v has an image tv, 
and second, inductively on dg, for every 1-edge v of Y and the corresponding 
edge w of # (which could be the trivial edge), the residual morphism 
®@, : GY, > GY, is a proper one. 

(2) @ defines # as a level-5 quotient of Y, or that # is a level-5 quotient of Y 
via ®, if ® is proper, and we have ®@(G) = H. 

(3) We notice that a level 6 morphism ® : Y — # is a proper morphism iff in the 
notations from Definition/Remark 24 (3), for every tv and vb which correspond 
to each other one has %, 6, = ¥%g, and the residual map go, : 4g, > Vx, 
is onto. 

(4) If 6: Y > # is a proper morphism, and tv and v correspond to each other, 
then the corresponding residual morphism ®, : Y — Gp is proper. 


Remarks 28. Let 2: > # bea level 6 > 0 proper morphism. 


(1) Let » be a multi-index of Y, and let T, = Ze be the inertia group at b, as 
defined in Remark/Definition 2 (3). Then ®(7T,) is a free Ze-module of rank 
6’ < dy. Suppose that 6’ < 6. Then using the properness of ®, one checks by 
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induction on 6, that there exists a unique multi-index tw such that the following 
hold: ®(Z,4) C Zy and ®(T,) C Ty, are open subgroups. Thus in particular, 
to corresponds to v. 

(2) Denote by Ty the subgroup of G generated by all the inertia elements 
of G, and define 7,4 C HA correspondingly. Then in the notation from 
Definition/Remark 3 (2), ® gives rise to a commutative diagram as follows: 


l1> Tg —-> G—> Ig -1 


je fe 


l1> Tye —> H—Ihewe 1. 


Next suppose that Y and # are divisorial, and let Ty = (t,), and Tg = (tw)w 
be distinguished systems of generators for Y, respectively #, which give rise to 
abstract divisor groups Divs, and Divz,, for Y, respectively #, and abstract 
divisorial lattices Ag and Ay. 


(3) For every w, denote by X,, the set of all the v to which w corresponds. Then 
X,, 1s finite non-empty (by the fact that ® is proper). For every w andv € X,,, 
there exists a unique a,,, € Ze such that ®(t,) = 14". Equivalently, denoting 
Bg = (P,)y and B.v = (Py) the dual bases to Tg = (1,), and Ly = (Ty) 
as defined/introduced at Construction 5, by Remark 26 (4) above via db we have 


Pw > vex, GvwPr> 
and therefore ¢ gives rise to a morphism 
dive : Dive — Divg 


which maps Divr,, ® Ze into Divy, ® Z, and fits into the following 
commutative diagram: 


n~ a TO n~ 
3 Use Ae Diy pL S0 
(*) \3 | : (sive {ean 
0—> Ug sly Dively —>0 
(4) Recall that for every divisorial 0 g-lattice Ax., in Ag one has the following: 
Agin = Azz ® Ze, and therefore Aggy is exactly the preimage of Divs, ® Ze 
under 7%. Hence from the commutative diagram (*) above it follows that 


wa 


(xt) Ayin = $7! agin). 6Axein)NUg = OC x), d(A xin) = (A) NA gin. 


In particular, d maps finite-corank submodules into such, and preimages of 
finite-corank submodules under ¢ are again such. 
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(5) With the above notation, the following are equivalent: 


(a) There exist fy = (t,)), Lx = (Ty)w Such that a,, € Ze) for all 
w,ve X,. 

(b) dive (Divs ,,) C Divey. 

(c) (Ax) € Ag. 


And if the above equivalent conditions are satisfied, one has equalities as 
follows: 


(xx) Dive y. = divg! (Divey), Ary =¢ '(Asy), GAs) = dA) N Agy- 


Thus in particular, Az ,, can be recovered from Ag,, via @. 


Proof of (5): The implication (a) = (b) follows immediately from the definition of 
div, and the implication (b) => (c) follows from the definition of A. and Ag. In 
order to prove (c) => (a), let v € X,, be given. Then combining Remark 10 (1), with 
the second diagram from Remark 26 (4), we get a commutative diagram of the form 


3% 
Ag yp _ Z0) Pw 


| , {aw 
Agy a Zo Pv; 

hence it follows that a,,, € Ze), as claimed. Finally, let us show that in case 
the equivalent conditions (a), (b), (c), are satisfied, the equalities (**) hold. First 
observe that since db and div are injective, all the above equalities are equivalent. 
Thus it is enough to prove one of them, say the first one: Recall that by point (3) 
above, Divs ,, ®Zy and Divs,, ®Z, are free Zp-modules on the bases By = (Qy)w, 
respectively By = (@,)y, and that dive maps the former Z-module into the latter 
one. Hence divs! (Divs.,) C Divs ,, ® Ze. Now let x = Yo by @y with by € Ze be 
an element of divg' (Divs, ). Then divo(x) = D>, Dyex, PwdywG lies in Dive, ; 
hence b,,a,,, € Ze) for all w and v € X,,. On the other hand, since ay, € Zyy, 
it follows that b,, are rational numbers. Since they lie in Z, as well, it follows that 
by € Ze). But then we finally get that x = )°,, by @y lies in Az, as claimed. 


Definition/Remark 29. Let  : Y¥ > # be a level-d proper morphism of 
divisorial abstract decomposition graphs with 6 > 0. 


(1) We say that @ is divisorial, if all residual morphisms ®, : 4%, > Mp, with tw 
of length < 6 satisfy the equivalent conditions (a), (b), (c) from (5) above. 

(2) If@: Y > # is divisorial, the residual Kummer morphism bo maps divisorial 
lattices for Ap into divisorial lattices for Y,. And the commutative diagram (+) 
from Remarks 28 above gives rise to a commutative sub-diagram: 


2 a 
00> Ooh nye Sly >0 
(* *) \3 \3 “i (sive {can 


6=3 0 ps4 ding Stig 0 
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(3) It is not too difficult to give examples of proper morphisms ® of divisorial 
abstract decomposition graphs such that ® are not divisorial. Indeed, one can 
give such examples even in the case that both Y and # are complete curve 
like, and @ is a proper morphism of level 6 = 1. The next proposition shows 
that actually the case 6 = 1 is the “generic” source for proper non-divisorial 
morphisms. 


Proposition 30. Let : Y > # be a level-5 proper morphism of divisorial 
abstract decomposition graphs, where 5 = de > 0. Then the following hold: 


(1) Let v, 1 be all pairs of multi-indices of Y, respectively of FH, such that vo 
has length 6 — 1 and corresponds to ». Suppose that for all such pairs », to 
the residual morphism ®, : Gy — Hp is divisorial. Then ® : G > # is 
divisorial. 

(2) If ® is an isomorphism, then b is an isomorphism too, and © is a divisorial 
morphism of abstract decomposition graphs. Hence for any abstract divisor 
groups Divg and Div. of GY, respectively #, there exists ¢ € Zy such that 
the diagram below is commutative: 


ue a 
0 pA opine ei SO 
(* > >) led led [edivo [ercan 
0 Ug Ag = Dive Sle = 0 


Proof. To (1): One carries out induction on dg. 

Case (1) dg = 1. Then 1 = d¢ > 5 = dx > 0; hence all these numbers 
equal |, and the assertion follows from/by the definitions and the hypothesis of the 
proposition. 

Case (2) 6 > 1 arbitrary. Let v be some 1-index of Y, and w the image of v 
under ®. Note that w is either the trivial valuation wo, or otherwise w is a l-index 
of #. We show that the resulting residual morphism ®, : Y > , satisfies 
the hypothesis of the proposition: First , : 4% — #%, is a proper morphism, 
as @: Y > # was so by hypothesis. Second, let t,, be a multi-index of %, of 
length 6. —1, and ,, a multi-index of Y, such that tv, corresponds to v,, under @,,. 


Claim 1. ®y,, : Go,, > Hew, is divisorial. 


Indeed, let us first suppose that w = wo is the trivial valuation. Then “4, = #, and 
to := tv, is a multi-index of # of length 6 — 1. Further, (v,,, v) is a multi-index 
of Y which corresponds to tv. And we have that %,, = %»,,v), Hp», = Hp, and 
®,, : G,, > Ap, is actually the same as Diy») : Gv,.v) > Ap. But then the 
claim follows from the hypothesis of the Proposition. Next suppose that w # wo is 
a l-index of #. Then ®, has level 6 — 1 = 6.2 — 1 = 5. Moreover, if tv, is a 
multi-index of .%, of length 5, — 1, then (tv,,, w) is a multi-index of # of length 


(6.2, _ 1) t)]= 5%, = 6 —1. 
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And since tv,, corresponds to b,,, and w to v, it follows that (tv,,, w) corresponds to 
(v,,, v). But then by the hypothesis of the proposition, the residual morphism 


Pow,,.) : Gv,,.v) = HK v,,,w) 


is divisorial. On the other hand, by definitions we have identifications %p,, 1) = %p,, 
and H(m,,.w) = Hp, and Pip, ») = Ppy,,. This completes the proof of the Claim 1. 

Coming back to the proof of assertion | of the proposition, let Avg and Ag 
be divisorial lattices in Ax, respectively Ag. For [ C Ag of finite corank and 
satisfying 7 Oy = (0), set A := bl). 


Claim 2. db: A He Ag maps I isomorphically onto its image A, and further 
one has that An U g = (0), and A is a finite-corank Zy-submodule of Ay. 


Indeed, by diagram (*) from Remark 28 (3), and in the notation | from there, we 
have that ;” is injective on I, since [9 U = (0). Si Since dive is injective, 
it finally follows that 7” (I) is mapped injectively into Divy. Therefore, b maps 
I’ injectively into Ay, and A := = $(L) ) has trivial intersection with U g. Now let 
us check that A has finite corank in Ag: First let v be a 1- -edge of Y such that 
J’) # (0). Equivalently, A = or ) has a non-trivial image under 


sod : Rp he Tis 


Hence 7” : $(A) is non-trivial. Since the above sequence is f-adically dual to 
Ty, ~G Ze H, it follows that &(T,) ~ 1 in H. Since @ is proper by hypothesis, 
it follows that there exists w such that (7,) C T,,, and ®(T,) is open in 7,,. Hence 
finally w corresponds to v. Therefore, if 7"(A) # (0), then there exists some w 4 wo 
corresponding to v. 

Next, by the commutativity of the second diagram in Remark 26 (4), it follows 
that 7"(A) 4 (0) if and only if 7”(I") 4 (0). Now since I" has finite corank, there 
exist only finitely many valuations w of H such that 7"(") # (0). Finally, for each 
such w there exist only finitely many v’s such that w corresponds to one of the v’s. 
Thus finally there are only finitely many valuations v of G such that 7’(A) 4 (0). 
This completes the proof of Claim 2. 

Now suppose that J” is non-trivial. Then we have the following situation: I” 
and its isomorphic image A are non-trivial finite-corank submodules of Ax, 
respectively Ag. Since AN Ug = (0), it follows that AN Ag is a lattice in A which 
completely determines the divisorial lattice Ag in the f-adic equivalence class of all 
the divisorial lattices of Y. Correspondingly, the same is true for 7M U ze and Ay, 
etc. On the other hand, since A has finite corank, by the ampleness of Y, there exist 
valuations v of G such that the following are satisfied: 


Gj) AC U,, and Jv maps A injectively into Ag,, and for such v set A, := (A). 
Gj) Ay NUg, = (0), because AN Ug = (0) by the discussion above. 
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For such a valuation v, the lattice A M Ag is mapped by j, isomorphically onto 
a lattice in A,. Hence by the properties (i), (11), from Fact 8, we get that there exists 
a unique divisorial OU g,-lattice Ag, of Y, such that 7,(A MN Ag) = A, NM Ag,. For v 
as above we analyze the following cases: 
Case (a): ®(7,) = 1. Then the trivial valuation wo corresponds to v, and for the 
residual morphism @, : Y, > # we have that ®, y is divisorial by Claim 1. Hence 
by Remark 28 (5), there exists | a unique divisorial 0 ye-lattice Av of # such that 
the Kummer homomorphism dy: Aso Ag, maps A yw into Ag,. 
Case (b): ®(7,) # 1. Then there is a non-trivial valuation w corresponding to 
v, and for the corresponding residual morphism ®, : Y% > “%, we have that ®, 
is divisorial by Claim 1. Hence by Remark 28 (5), there exists a unique divisorial 
U w,-lattice Av, of Z, such that the Kummer homomorphism dy : Ax, > Ag, 
maps Aw, into Ag,. Moreover, since d maps J” isomorphically onto its image A, 
and 7, maps A isomorphically onto its image A,, we get that since 7,0 db and j,, 0 dy 
coincide on I’, it follows that 7,, maps I” isomorphically onto its image Jy,, and 
that by maps I\, isomorphically onto A,. Therefore, w satisfies mutatis mutandis 
the conditions (j), (jj), above with respect to I”. Hence I, 1 Ay, is a lattice in I,. 
Hence there exists a unique divisorial U we-\attice A wz of 7 such that 1M Ay is 
mapped isomorphically onto I, 1 Ay. 


Claim 3. In both cases above, b maps Aw into Ag. 


First, with the notation from above, it is clear by the discussion above that d maps 
I'NA x isomorphically onto AN Ag. Now let I’ be a finite-corank Z;-module such 
that INO yg = land CI’. Let A'y, be the divisorial U y-lattice given by the 
construction above when starting with J”’ instead of I”. Then we have that "1 A’, 
is a lattice in I, which is €-adically equivalent to 1 N A... Hence ar N A.) and 
ar M Ay.) are €-adically equivalent lattices in A = or ), and both of them are 
contained in Ay. Hence al NAg) = al N Aly), thus PN Ag =O Aly. 
Hence finally A, and A‘,, are equal. In other word. for every finite-corank Ze- 
module I’ of # as above we have that if [ C I’, then I’M Av is mapped into 
Ag. But then Ay = Ux + Ur/(I’ NM Az) is mapped into Aw, as claimed. 

To (2): Since @ is an isomorphism, it follows that dg = 6 = dy, hence @ gives 
rise to a bijection of the multi-indices v and tv of length 6 — 1 of Y, respectively of 
AC; and if » and tw are such indices, then the residual morphism ®, :Y%, > Hp is 
by definition an isomorphism of compete curve-like abstract decomposition graphs. 
Thus by assertion 1, it is sufficient to prove that all ®, : % — Mp» as above are 
divisorial. Let (o,), be a distinguished system of inertia generators for Gy, where 
the v are the 1-edges of Y,. If w is the 1-edge of 4, corresponding to v, then setting 
Ty i= ®,y(0,), the system (t,),, is a distinguished system of inertia generators of 
Ay. In particular, condition (a) from Remark/Definition 28 (5), is satisfied. Thus 
@, is divisorial by definition, etc. Oo 
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4.2 Rational quotients and geometric like abstract decomposition 
graphs 


We begin by first defining rational quotients of divisorial abstract decomposition 
graphs. The point is that (divisorial) abstract decomposition graphs that arise from 
geometry have “sufficiently many” rational quotients; and morphisms of (divisorial) 
abstract decomposition graphs arising from geometry are compatible with the 
rational quotients. This suggests that for applications, one should consider/study 
divisorial abstract decomposition graphs endowed with “sufficiently many” rational 
quotients, and morphisms of such enriched structures. 

To begin with, let % be a level-one complete curve-like abstract decomposition 
graph. Recall the notation from Construction 5, Case 6 = 1: For every distinguished 
system of generators Ty = (t,), of GY, we have an exact sequence 


G 
* can 


0— Cy, o> Ag, 2+ Dive, — lz, = Za > 0. 


Definition/Remark 31. With the notation from above we define: 


(1) A level-one divisorial abstract decomposition graph G, is called rational if 
Ug, = (0) for some (thus every) distinguished system of inertia generators 
Ty of Y,, as introduced in Construction 5, Case 6 = 1. 


We notice the following: Since Ue, = (0), every U g,-lattice in GY, is actually a 
lattice in Ay,. Let {4 = (1,), be a distinguished system of inertia generators, and 
Ba = (Y,), the corresponding Z,¢)-basis of Divz,. An element of the form 


x= hy -% 


is called a generating element of Ag,. We set (x)o := v’ and (X)oo := v, and call 
these the zero, respectively the pole, of x. Further, we define 


PY, ={x € Ag, | x generating, and (x) =v} = {gy —g,| all’ 4 Vv}, 


and call it a generating set at v for Az,. Clearly, A, defines a Z(g)-basis of Az, 
for every v. And if 2’, = Y is another distinguished system of inertia generators, 
and Y| is correspondingly defined, then ¢ € Z; is the unique ¢-adic unit such that 
e- F = F#,. 


(2) Let Y be a level-6 divisorial abstract decomposition graph, where 6 > 0. 
We will consider quotients ®y : ¥ > Y% of Y together with their Kummer 
homomorphisms ba : Ag, > Ay and in order to simplify notation we set 


a i= bu(Ay,) © Ag. 


Further, for every multi-index v of GY, let Jy : Us > Ae be the canonical 
reduction homomorphism; see Remark 26 for definitions. 
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With the above notation, we say that , : Y > Y, is a rational quotient of Y, if G, 
is rational, and @, is divisorial and satisfies the following: 


(i) For all multi-indices v the following hold: If Jy is non-trivial on Ao N io 
then. Ay C Uy, and Jy is injective on Ag, or equivalently, jp od, is injective 
on Ay,. 

(ii) For every finite Ze-module A C Agtin with U @ CA, there exist 1-edges v 
such that y’(Aa) # O and ker(A a Ag,) =ANAg. 


Fact 32. Let By : Y > GY, be a rational quotient. Then Uy al Bes = 0, and one 
has: 


(1) Let Ag bea divisorial U ¢-lattice in A. Then there exists a unique divisorial 
lattice Ag, in Ag, such that oy(Ag, ) is contained in Ag. Moreover, the images 
Ay := by(Ag,) can be recovered from Ay = ¢y(Ag,) and Ag as follows: 


(x) Aw := ba(Ag,) = Aa N Ag. 


(2) One can recover Ag from Ag using the maps j” and J, as follows: 
(**)Ag = {x € Ag |For all v with y"(Ag) # Oand ’(x) =0, one has j,(x) = 0}. 


Proof. First, since GY, is rational, by definition we have Og, = 0. But then by 
Remark 28 (3), one has 0 = o(U4,) = Ue a ve as claimed. 

To (1): Since ®, defines Y, as a rational quotient of Y, it is divisorial (by definition), 
and Ue, = 0. Hence we can conclude by applying Remark 28 (5). 

To (2): Clearly, if x € Ag, then it satisfies the hypothesis from (*+), i.e., for all v 
with y’(Ae) 4 Oand 7"(x) = 0 one has j,(x) = 0. For the converse, let x € Ay 
satisfy hypothesis (**), i.e., be such that for all v with 7 "(Ag ) # Oand 7"(x) = 0 
one has j,(x) = 0. Since U gM tee = 0, by condition (ii) in the definition of ®g, it 
follows that there exist v such that 7 "(Ag )#0 and /, 1s injective on U g. Therefore, 
by the hypothesis (**), it follows that x e. U g. By contradiction, suppose that 
x ¢ Ay. Let A = Og. + Zex. Since x € Agi, we have A C Agin, and since 
xd Ua. the inclusion U g C Ais Strict. 

Case (a). AN Ay = (0). 

Then by property (ii) of ®, it follows that there exists v such that 7"(Ag) # 0, 
and A C U,, and Jy is injective on A. In particular, x € U,, and Jv(x) is non-trivial. 
Contradiction! 

Case (b). AN Ag F (0). 

Then there exist u € Ue, be Le and z € Ao, Zz # 0, such that u + bx = z. In 
particular, since z€ AC Ag C Aewin: we have z € Agin- Further, b 4 0, because 
Uy nN Ay = 0. Setting Ap := Us. we have Ag C A, and Ap N Ae = 0. Hence 
there exists v such that y’(Ae) #0,and AC U,, and J, is injective on Ap = Ue, 
Thus we have 

Jv(—u) = Jo(Z— u) = bj, (x), 
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hence j,(u) # 0 iff y,(x) # 0, because b H 0. First, if u A 0, then j,(u) F 0, 
hence j,(x) 4 0. Since 7"( ”(Ag) # 0, this contradicts the hypothesis (*+*). Second, 
ifu = 0, thenbx = ze Ag: Since Ag | Ae is torsion-free, it follows that x € Des 
as claimed. Oo 


Definition 33. Let Y be a divisorial abstract decomposition graph, and let Ay C 
Ag be a fixed divisorial 0 g-lattice. Let 29 = {Dy ty be the set of rational quotients 
of Y. For every subset 21 C 29, we define 


Aq = Dae A 


as the Z¢)-submodule of Ag C Ag generated by all the Ay with ®, € 2. 
(1) We say that 2lis an ample set of rational quotients of Y, if the following hold: 


(i) For all a, a’ one has that if ©, # Py’, then Ay A Ag = = (0). 
Gi) Ag A Ug= = (0) and Ay is €-adically dense in Ag. 


e 


Ym 


Suppose that 2l is an ample set of rational quotients of Y. We will say that Y is 
geometric like with respect to X if for every a, a’ there exists a multi-index bv of 
¢ such that: 

(Gj) Ao and Aw are contained in Us: a ” 

Gj) Jv maps Ag and Ay injectively into Ag,, and Jy(Aw) = Jy (Aw). 


(3 


wm 


In the above context, we will call Ag an 2l-arithmetical lattice. Its ¢-adic 
equivalence class depends in general on 2, and not only on equivalence class of 
Ag. Further, _ 

Ug + Ay © Ag 


isaU g-lattice in Ay, and therefore Ay if (U @g + Ag) is a torsion free divisible 
group, hence a Q-vector space. But in general, Ug + Ay is not necessarily a 
divisorial U g-lattice. 


Definition/Remark 34. Let Y and # be geometric-like abstract decomposition 
graphs with respect to some sets of rational quotients 2%) = {®y}q, respectively 
Bo = {Hg}4, and let a proper morphism © : 4 — of level 6 := dy be given. 


(1) We say that ® is compatible with rational quotients if there exist ample subsets 
2A C Ao and B C Bo satisfying the following: First, Y and # are geometric- 
like with respect to 21, respectively %. Second, for each Wg € ® there exist 
®, € A and an isomorphism Pyg : Gy — Hp such that the following diagram 
is commutative: 


(*) {22 | 
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(2) We observe that in the above context, for every Wg € % there exists a unique 
®, satisfying hypothesis (*). Indeed, if ®, together with ®yg also satisfy 
hypothesis (*), then Ag, = bop (Ax), and therefore we get 


n~ 


Aa = ba(Ay,) = u(bap(Axg)) = O(¥p(Axq)) = (Ap). 


Since the same is true correspondingly for a’, we finally get Ae = b(A A= Ag! 
But then by Definition 33 (1) (4) it follows that 6, = ®y, as claimed. 

In the above context, we say that a corresponds to B if the aes (*) is 
satisfied for Wg and ®,. Thus a corresponds to f if and only if dA A= 


Proposition 35. In the above context, let ® : G +> be a level-6 proper 
morphism of geometric-like abstract decomposition graphs which is compatible 
with the rational quotients 2 and 8, where 5 := 6. Then © is divisorial. 


(1) More precisely, let b : Aye — Ay be the Kummer homomorphism of ®. Let 
Asg be an arithmetical lattice for # defined by 8. Then there exists a unique 
arithmetical lattice Ay for G defined by A such that (As) C Ay, and one 
has 


$(Axs) = (Ay) O Ag. 

(2) Suppose that $(As) Gc Aa, and for each a and B consider the unique 
divisorial lattices A yy, C Ax and Ag, C Ag, such that Ag := op (Avg) = 
bp(Axy) N Ags and Ay := bu(Ag,) = = ba(Ag,) A Ag. Then for all a, B it 
follows that ®y € A corresponds to Vg € &B if and only if bup (Avg) = Ag, 
and $(Ag) = A 

Proof. It is clear that assertion (2) follows from assertion (1) and previous discus- 

sion. Therefore we will concentrate on the proof of assertion (1). 


First recall that by Definition/Remark 34 (2), we have that a corresponds to B if and 
only if dA A= Ag. Using this we deduce the following: 


- @ maps Ag := dies Ag into Ag := Ye,ea 4 

Let Ass and Ay be fixed arithmetical lattices of .#, respectively Y. For a given 
B, choose @ corresponding to it. By Definition/Remark 31 (3) above, and with 
the notation from there we have that there exists a unique divisorial lattice Ag, in 
Ag, such that by maps Ag, into Ay, and actually bu (Ag,) = Ag A Ag. And 
correspondingly, the same is true for B, i.e., there exists a unique A, in A Hs 


such that Wp( Ax) = Ag M Ags. 


Since @ corresponds to £, with the notation from Definition 34, let bop be the 
Kummer isomorphism defined by ®,g. Then bap (A) is a divisorial lattice in 
Ay,. Thus there exists an ¢-adic unit éyg such that 


oup(Ax,) = Eap- Ay, - 
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On the other hand, the commutativity of the diagram (*) from Definition/Remark 34 
translated in terms of Kummer homomorphisms means that the above equality is 
equivalent to the following: For all 6 and its corresponding @ one has 


(ap) b(Ag) = €ap * Aw - 


Let 6 and f’, and the corresponding a and a’ be given. Hence d maps A p and A B’ 


isomorphically onto De respectively A p’. Since Y is geometric-like with respect to 
the family of rational projections 2, it follows that there exists some multi-index 
of Y which has the properties (j), (jj), of Definition 33 (2). 

Before moving on, we recall that by Fact 8 (2), the fixed divisorial U @g-lattice 
Ag of Y defines uniquely a v-residual Ug, -lattice Ag, by setting 


Ag, := Ug, + jo(Ag NU»). 


We further remark that condition (j) from Definition 33 (2) implies that 
®,(T,) = 1. Hence ®, gives rise to a residual morphism ®yy : G > G. And 
if Den : Ay, — Ag, is the Kummer homomorphism of ®yq, then Jy ° bu = boa: 
Therefore we have 


Jo(Aw) = GralAg,), Jv(Aa) = bral Ay,). 


Now since @, is divisorial, ®yq is so by definition. Hence by Remark 28, 5), we 
have: 


$oa(Ag,) = bora(Ay,) N Ay, - 


Thus combining the assertions above, we finally get 
Jo(Aa) = Jo(Aa) 0 Ag, - 


On the other hand, both a and a’ satisfy condition j) from Definition 33 (2). Hence 
by symmetry, the equalities above hold correspondingly for a’ too. And since by 
condition jj) of Definition 33 (2), one has bva(Ay,) = = es “er = boa! (Ae /), we get 


(a) pA = Aine A Ag, = jolie): 


On the other hand, since @ is proper, there exists some to corresponding to v. Recall 
the second diagram in Remark 26 (3), from which we bring forward 


U to 2, A Foy 
1 Joo 
Jo ~ 


Us —_ Ag, 
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and recall that dé, bo are injective. Since Ag = b(Ag, ), Ay = o(Ag,,), we get: 
(c) Ag, Ap ime 
(d) Ag and Ag’ are mapped by Jr : U ie > Age injectively into A #é,, and have 


to 


equal images Jo (Ap) = Aw ,pp’ = Ivo (Apr). 


And note that bv maps Awp,gg’ isomorphically onto Ap oo’. Then going through 
the same steps as above and using notation correspondingly, we get as above 


(B) Jr (Ag) = Jwo(Ap’). 


We conclude the proof of the proposition as follows: For 8, 8’ and a, a’ correspond- 
ing to them, with the notation from above, we have by relation (a) above, 


cop Aa = (Ap) and eqrpr- Ag = b( Ap) 


for some -adic units gg and €y’g. Applying Jp» to the above equalities, and taking 
into account that by the commutativity of the diagram above one has Jy of = bv ° Siw 
on Uw, thus on Ag, Ag C Ow, we finally get 


Jo(Eap * Aa) = Jvo($(Ag)) = (iv 0 $)(Ag)) = (bv © Jw)(Ag)) = bv (Jw (Ap)) 
and correspondingly 
Jo(Ea’p’* Aa’) = Jo($(Agr)) = (10 0)(Agr)) = (Gv © Jw) (Agr) = Go (Jro(Ap’))- 


On the other hand, jm(Ag) = Jw(Apgr) by remark (8) above; hence the last two 
terms of the equalities above are equal. Thus we get 


Jo(Eap * Aa) = Jo(Ew pr Aa’), hence ag - Jo(Aw) = €a’p’ + Jo( Aa’). 
On the other hand, jy(Ag) = Jv(Aq’), by equalities (w) above. Thus finally 
Eup * Jo (Aa) = Eq/p’ * Jo (Aa) . 


Next recall that if us : Ay, > Ag, is the Kummer homomorphism of the residual 
morphism ®yy : % > GY, then we have jy(Ay) = boa(Ag,), and the latter is a 
Uy, -sublattice of Ag, . Hence finally éog /€q’g’ must be a rational €-adic unit. Since 
B, B’ were arbitrary, we conclude that for every fixed Bo and the corresponding 
Qo, after setting € := &g)p), one has H(A) = é- Ay. Equivalently, b maps 
As = Dip Ag into e- Ag = &- Dog Aa Oo 
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5 Morphisms arising from algebraic geometry 
5.1 Morphisms 


Let k and / be algebraically closed fields of characteristic 4 £. Let K|k and L|I be 
function fields, and let 
1: LILO K\k 


be an embedding of function fields such that / is mapped isomorphically onto k, and 
K|i(L) is a separable field extension; see e.g., Lang [18] for a thorough discussion 
of this situation. 

As defined in the introduction, let A and G'" be the total graphs of prime 
divisors on K, respectively on L. Thenz gives rise in a canonical way to a morphism 
of the total prime divisor graphs 


Q; : GS = a 


The precise definition of ¢, is as follows: First let v be a prime divisor of K|k. Then 
either the restriction vz := v|, of v to L|/ is the trivial valuation wo of L|/, or vy is 
a prime divisor of L|/ otherwise. In both cases, 7 gives rise to an embedding of the 
residue function fields 

ty: Lyyp|lo Kk. 


Inductively, we deduce from this that if v = v, o---o vy; is a prime r-divisor of K|k 
as defined in the Introduction, then to := v|z, is a prime s-divisor of L|/ for some 
non-negative integer s < r. Moreover, by general valuation theory, it follows that 
every generalized prime divisor of L|/ is the restriction of some generalized prime 
divisor of K|k; hence ¢, is surjective, etc. 

The situation will become clearer after we analyze in more detail how geometric 
prime divisor graphs Dx of K|k behave under ¢,. 

First, observe that if K|:(L) is finite, then for every generalized prime divisor 
to of L|/, its fiber is finite of cardinality bounded by [K : 1(L)]. From this one 
immediately deduces that the image of every geometric decomposition graph for 
K|k under ¢, is a geometric decomposition graph for L|/, etc. 

Therefore, let us assume from now on that K' |: (L) is not algebraic. Then denoting 
by K;,|k the relative algebraic closure of 7(L) in K, we have that K,|:(L) is 
finite separable, and K|K, is a regular function field extension. The situation of 
L|l — K,|k was explained above. Thus mutatis mutandis, let K|z(L) be a regular 
field extension. 


Lemma 36. Let X be a projective normal model for K|k, and D D> Dy a set of 
prime divisors with D\ Dy finite. Then there exist a projective normal model X for 
K|k and a dominant k morphism ¢ : X — X such that D C Dg; hence D is 
geometric. 


Proof. Clear. Oo 
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Using the lemma above, we have that there exist projective normal models 
X — k for K|k such that Dy contains the l-edges of Zr and X is complete 
regular-like. And correspondingly, the same holds for L|/ and Y. On the other 
hand, the regular embedding of function fields: : L|/ + Kk is the generic fiber of 
a dominant rational map f : X --+Y which factors through: : / — k. And note 
that since X and Y are normal, f is defined at all points x; of codimension one of 
X. Moreover, replacing X — k by a properly chosen blowup, and normalizing 
the resulting k-variety, we can suppose that f : X — Y is a k-morphism of 
projective normal varieties. And since K |1(L) is a regular field extension, it follows 
that f : X — Y has geometric generic integral fibers. Hence by the characterization 
of (the dimension of) the fibers the following hold: 


¢ Atalmost all points x; of codimension one in X, f (x1) is either the generic point 
of Y, or y) = (x1) is a point of codimension one of Y otherwise. 

* On a Zariski open subset V C Y, the fiber X, at y € V is irreducible, and if 
Xs C X is the Zariski closure, one has the following: 


codim(y) + dim(X y) = dim(X). 


Hence for almost all points y; of codimension one in Y, the closure of the fiber ve 
is irreducible and has dim(X y,) = dim(X) — 1. Equivalently, a is a Weil prime 
divisor of X, and its generic point x; has codimension one in X and is mapped to y;. 

In birational terms this means the following: For every prime divisor v = v,, € 
Dy let w := v\_ = Q,(v) be its restriction to L. Then the center of w on Y is 
y, = f(%1), and one of the following holds: 


(a) wis the trivial valuation of L|/. This is so iff y; is the generic point of Y. 
(b) wis a prime divisor of L|/. Then either y; has codimension one in Y, and if so, 
then w is the Weil prime divisor defined by y,, or y; has codimension > 1. 


In particular, we see that the following hold: First, all w € Dy have preimages 
vin Dy, and for almost all w the preimage v is unique. Second, there are at most 
finitely many “exceptional” v € Dy for which ¢,(v) does lie in Dy. Let 3’ be that 
set. 

We now claim that for the given projective models X¥ — k and Y — / as above, 
there exist quasi-projective normal models X —k and Y > / dominating X > k 
and Y — /, anda morphism f : X > Y above f : X — Y, and having the 
following property: 


(*) gi (Dz U {vo}) = Dz U {wo}, 


where vo and wo are the trivial valuations. In particular, Dy C Dg and Dy C D;. 
Indeed, if g,(Dx U {vo}) = Dy U {wo}, Le., if the exceptional set X’' is empty, 
then there is nothing to prove. Hence consider some v := vx, € 2+ such that 
the center y, of w = g,(v) has codimension > 1. Let Y, C Y be the closure of 
y, in Y. Setting Y; := Y, and Z; := Y,, we consider a sequence of blowups 
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+++ VYn41 > Y, >... as follows: Z, C Y,, is the closure of the center of w on 
Y,,. We stop if Z, has codimension 1, and blow up Z,, otherwise. Then the above 
sequence is finite. Moreover, if codim(Z,,) > 1, then Y,4, — Y, is an isomorphism 
outside Z,,. But then if the process above stops say at Y,,, it follows that Z,, is the 
center of w on Y,,, and codim(Z,,) = 1. An easy Noether induction shows that one 
gets models Y’ dominating Y such that y,(Dy) € Dy, U {wo}. On the other hand, 
f : X + Y can be interpreted as a dominant rational map fd : X --+Y’. Since X is 
normal, and Y’ is complete, f is defined at all points v ¢ Dy and maps these points 
into Dy by the discussion above. To conclude, let Sy C Y’ be the Zariski closure 
of the (finite) complement of D;,\g,(Dx), and Sy the preimage of Sy under /. 
Finally, set Y := Y’\Sy, and X = U(f)\Sx, where U(f) is the the domain 
of f. Then by the choices made, it follows that f defines a dominant morphism 
f : X — Y which has the required property (*). 

Now using the fact (+) above and proceeding by induction on the transcendence 
degree of the residual function fields Lol] — Kov|k, a straightforward Noether 
induction argument shows finally the following. 


Proposition 37. In the above context, let 9x C GS and F, Cc G' be geometric 
graphs of prime divisors for K|k, respectively L|l. Then there exists a unique 
maximal geometric subgraph By. C Dx such that y, defines by restriction a 
morphism of graphs of prime divisors 


1: Fe >. 


Moreover, for given geometric graphs x C Ae and A, C GF" as above, there 
exist geometric graphs of prime divisors Be D De and G D G, for K\k, 
respectively L|l, such that y, defines by restriction a surjective morphism of graphs 
of prime divisors 

1: De > Z@. 


Using Galois theory and decomposition theory of valuations, the above facts 
have the following translation in terms of abstract decomposition graphs: Let 
i’: L’ + K’ bea prolongation of 1 : L|] > K|k to L’, and let 


®, : [IK > TT, 


be the corresponding canonical projection of Galois groups. Then since 
1: L|l + Kk is a morphism of function fields, it follows that the relative algebraic 
closure L; of L|/ in Kk is a finite extension of L, thus a function field over /. But 
then it follows that ®, is an open homomorphism. 


Moreover, if y,(v) = w, and v’ is a prolongation of v to K’, then the restriction 
w’ of v’ to L’ satisfies, first, that w’ is a prolongation of w to L’. Second, let T, C Z, 
and T,, C Zy be the corresponding decomposition groups. Then @,(Z,) C Z,, and 
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®,(T,) C T, are open subgroups. (This discussion includes the case that w is the 
trivial valuation of L.) Moreover, if w is non-trivial, then wL C vK has finite index 
e(v|w). Hence we have commutative diagrams of the form 


L > whLCwLh= Hom(T7,,, Ze) 


|: ler vw) 


K —> vK CvK = Hom(T,, Ze) 


Therefore, if y,, and y, are the unique positive generators of vK, respectively 
wL, then y, is mapped to e(v|w)- y,. Thus if t € TJ, and t, € Ty, are 
the arithmetical inertia generators as defined/introduced at Remark 19 (2), then 
from the commutativity of the above diagrams and definitions it follows that 
@,(t) = 1), 

Now combining these observations with Proposition 37 above and Remark 28, 
especially (5), we obtain the following by merely applying the definitions: 


Proposition 38. In the notation from Proposition 37, the embedding of function 
fieldsi : L|l — Kk and the resulting canonical homomorphism ®, : Ix — I, 
give rise in a natural way to a level-td(L|!) morphism ®, : Ga, — Ga, of the 
corresponding abstract decomposition graphs. 


(1) Moreover, if p, : Be — GY is a proper morphism of graphs of prime divisors, 
then the corresponding ®, : Gj, —> Gg, is a proper morphism of abstract 
decomposition graphs. 

(2) Further, if ®, : Ga, — Gg, is proper, and both G, and Gy, are complete 
regular-like, hence divisorial by Proposition 23, then ®, is divisorial. 

(3) The Kummer homomorphism b: Lr of ® is actually the €-adic completion 
of the embedding of function fields 1 : L|l — K\k. In particular, 1 defines ®, 
uniquely. 

(4) Moreover, ®, defines 1 uniquely up to Frobenius twists. 


Proof. Assertions (1), (2), and (3) follow from the discussion above. 


To (4): Recall that in the Introduction we considered an identificationix :Ty « > Z¢ 
of the ¢-adic Tate module of K with Z,, and via that identification one gets 
the identification K = Homeont (7x, Ze). Explicitly, this identification works as 
follows: For each x € K™%, let 6(x) : Tx — Tex be the corresponding 
character defined in Kummer theory. Then 6, := 1x 0 6(x) is the homomorphism 
6. : Tx — Ze» defined by x. Given the embedding: : L|1 — Kk, by the functo- 
riality of Kummer theory one has 5(1(y)) = 1 0 5(x) o ®. Therefore, if we choose 
the identifications 1x : Ten — Ze,iz : Te, — Ze compatible with 7, i.e., such that 
1, = 1x01, it follows that one has 4,(y) = 6,0®; hence the Kummer homomorphism 
defined by ®, is 


d :L= Hom(/T,,, Ze) ~ HomUTx, Ze) = K, by > by) 


and therefore, b is exactly the £-adic completion of the embedding: : L* > Kk”. 
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Now let 7’ : L|/ <> K|k be a further embedding of function fields such that 
@, = ®,. Then choosing 1 : Te, — Ze such that 1), = 1x 01’, it follows that the 
Kummer homomorphism # of &, = ©, in this new setting is the £-adic completion 
of 1’. On the other hand, there exists an £-adic unit ¢ € Ze such that 7 a =e-1,. If 
so, then we have ¢! =e: b on L. Since b is the €-adic completion of 7, and $! is 
the £-adic completion of 1’, if we denote by yx : K* > K the ¢-adic completion 
homomorphisms, we have 


iK(v'(y)) = e- se (iQ), ye L™. 


Therefore, ¢ must be a rational ¢-adic unit, say ¢ = m/n with n,m natural numbers 
relatively prime to ¢. Equivalently, there exists ay € k such that 1/(y) = ayi(y)"/" 
in K, hence 1/(y) is of the form 1/(y) = w”/” in K, as k is algebraically closed. 
But then 1’(y) is an n™ power in K. Since this is the case for all 1’(y) € 1/(L), it 
finally follows that n = p* is a power of the characteristic exponent p of k and /. 
By symmetry, the same is true for m. Hence finally ¢ is a power of the characteristic 
exponent of & and /. Equivalently, 7’ is a Frobenius twist of 1. a 


Before studying the rational quotients in more detail in the next subsection, 
we mention the following weak version of the main result mentioned in the 
introduction. By Proposition 23, if %, is a complete regular-like decomposition 
graph for K|k, the canonical sequence | — O x > AH, > Div(Dk)«@ > 
Clg, — 0 can be recovered from %, up to multiplication by ¢-adic units e € Z7. 


Proposition 39. Let K|k and L|! be function fields over algebraically closed fields 
of characteristic £ £, and ® : Gogo > Gogo be an isomorphism. The following 
hold: 


(1) For every geometric decomposition graph Ga, for K|k there exists a geo- 
metric decomposition graph Gg, for L|l such that ® defines an isomorphism 
Gx, —> Go,, and Go, is complete regular-like (hence abstract divisorial ) iff 
Gq, is so. 

(2) Let ® : Gx, —> Gg, be an isomorphism as above, Gy, and Gg, be complete 
regular-like decomposition graphs with sets of 1-edges Dx and D,. Then © is 
divisorial, and there exists ¢ € ZF such that the Kummer isomorphism b of B 
makes the diagram below commutative 


“~ div 
(+ Ug 345 Dv Ope te. = 0 


{ed {ed |sdive {e-can 
di 


0+ Tea 15“ Dh y—eG, 0: 


Proof. Assertion (1) follows immediately by sorting through the proof of Propo- 
sitions 22, as the group-theoretical recipe given there is invariant under group 
isomorphisms. Assertion (2) follows immediately from assertion (1) above, com- 
bined with Propositions 23 and 30. oO 
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Next we turn our attention to rational projections of abstract decomposition graphs 
Gm, as above. Let t € K be an arbitrary non-constant function, and let x; be the 
relative algebraic closure of k(t) in K. Then x;,|k is a function field in one variable. 
We endow x;|{k with its unique complete normal model X, — k, which is also 
projective, and consider the corresponding graph of prime divisors Y, for «; and 
the resulting complete regular-like decomposition graph Y,, for [7,,. Then %, has 
level 6 = 1 and is divisorial, by Proposition 23. Moreover, if g; is the geometric 
genus of X;, then we have: 


(a) Ely, & Ze. 

(b) Og, = ZS as the €-adic dual of 17; (X;) = ZS thus g; is encoded in %,,. 

(c) The canonical (surjective) projection ®, : Hx — TI, defines a level- 
one morphism of abstract decomposition graphs ®, : G —- %,, by 
Proposition 38. 


Before going into the details of characterizing the rational projections, let us 
mention the following fact for later use: 


Proposition 40. With the above notation, suppose that k; —> K is a regular field 
extension, i.e., K is separably generated over k;, and K 1 kK; = k;. For normal 
models X and X; of K\k, respectively of k,|k, let f : X --> X; be a rational map 
defining Kk; — K. Then: 


(1) There exists on open subset U C X;, such that the fibers f, : Xy-->k(s) =k 

of f ats € U(k) are integral, and the generic point x, of Xs is the center 

of the unique prime divisor vy € Dy which restricts to s € X;. In particular, 

vs(K) = vs (kz). 

Let L|K be a finite separable extension of K which is linearly disjoint from K; 

over K, t.e., L 0K; = k;. Then for almost all s € U(k) the prime divisor vs 

has a unique prolongation ws, to L, and moreover, ws|Vs is totally inert, i.e., 

Lw;| Kvg is separable and |Lws : Kvs] = [L: K]. 

(3) Let AC K be a Zy-submodule of finite corank such that AN k; = 1. Then 
for almost all s € U(k) one has that A C Uy, and jy, maps A injectively 


(2 


wm 


into Ky. 


Proof. To (1): Since K |x; is a regular, it follows that the generic fiber f., : Xx, --=> Ki 
of f is geometrically integral. Hence the fiber f, : X; — «(s) of f is geometrically 
integral for s in a Zariski open subset s ¢ U C X;. The remaining facts are just the 
(valuation-theoretical) birational interpretation of this fact. 

To (2): This is just a souped-up version of assertion (1), using the fact that since L| K 
is separable, the fundamental equality [L : K] = )°, e(w;|v) f(wi|v) is satisfied for 
every discrete valuation v of K and the set of its prolongations w;|v to L. 

To (3): Choose some projective normal model X — k of K|k such that the rational 
map X --- X;, is defined on the whole X, and IT) x = IT,,p,. Then K*/x* embeds 
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in the divisor group Div(X,,) of the generic fiber X,, — x; of X — X;. Hence 
K*/k; is a free abelian group, and we have an exact sequence of free abelian 
groups 

1 KX/k* > K*/k* > K*/ KX > 1, 


and its £-adic completion 1 > & > K > KH > 1, where # := KX] Ke. 
Note that since AM kK; = 1 by hypothesis, the map K > XH is injective on A. 
For n = €°, consider the exact sequence 1 > k7X/n > K*/n > #/n > 1, 
and let A, C K%*/n be the image of A in K*/n. Then A is the projective limit 
of (A,)n- Further, setting &, := A, M (k//n), the projective limit of (3), equals 
ANk; = 1. Hence for every 7g there exists n > no such that the image of 5, > Fy, 
is trivial. 

The Kummer theory interpretation of the facts above is: Let Ky, := K[X/Ay | 
be the corresponding Z/n elementary abelian extension of K. Then Gal(K,,|K) 
is isomorphic to Hom(A,, 4,), and setting x, := K, M kK one actually has 
Kn = K;]/&, ]. And further, Gal(«,|«;) is canonically isomorphic to Hom(), [dn), 
and the canonical projection Gal(K,|K) — Gal(k,|k;) is given by 
Hom(A,,, Un) — Hom(%,,, “,), which is defined by the inclusion 5, — A,. In 
particular, setting M, := Kk,, it follows that K,|M,, is a Z/n elementary abelian 
extension with Gal(K,,|M,,) canonically isomorphic to Hom(A,,/&,, [n). 

Now recall that A is the projective limit of (A,,),; hence for no sufficiently large, 
the map A > A/¢ factors through A > A,,. Second, for any fixed no, ifn > no 
is sufficiently large, the image of 4, — &,,, is trivial. Therefore, the canonical map 
A, — Arn, factors through A,/&,; and therefore, the canonical map A > A/€ 
factors through A,,/&,,. We conclude that if 6 > 0 is the rank of the finite free 
Ze-module A, i.e., A X Z°, then Gal(K,, |M,,) has (Z/£)° as a quotient. 

In order to simplify and fix notation, for n > mg as above, set M := M,, 
L := Ky, and k := ky; hence M = Kx and L|k is a regular field extension, 
because kK, = K,k;. And denoting by Ay C M%/n the image of A in M*/n, one 
has L = M[X/Ay ], and in particular Ay X A,/&, by the fact that Gal(L|M) is 
canonically isomorphic to both Hom(A,,/ 4, 4) and Hom(A y, 2, ). In particular, 
Aw has (Z/£)° as a quotient. 

Changing gears, let Z; — X, be the normalization of X; in the function field 
extension kK; <> kK, and Z — X the normalization of X in the field extension 
K — M. Then the morphism X — X; is dominated by Z — Z,, and the following 
holds: Since M |x is a regular field extension, the generic fiber of Z — Z, is 
geometrically integral. Therefore, there exists an open subvariety V C Z; such 
that for all s € V(k), the fiber Z, > x(s) = k is integral. Consequently, the prime 
divisor v; of M defined by the Weil prime divisor Z,; C Z is the unique prime 
divisor in Dz which restricts to the point s € V(k). 
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Further, let Y — Z be the normalization of Z in M <> L. Then arguing as 
above, it follows that for almost all s € V(k), the fiber Y, > x(s) = k of Y > Z, 
at s € V(k) is integral, and the prime divisor w, of L|k defined by the Weil prime 
divisor Y, C Y is the only prime divisor in Dy which restricts to the point s € V(k). 

But then w, must restrict to vs too, and moreover, w, is the only prolongation of 
vs from M to L and ws|vs is inert. By the fundamental equality we conclude that 


Gal(L|M) = Z,,,\,, > Gal(Lw;|Mvs) 


is an isomorphism. By general valuation theory one has Lw,; = Mv,[X/Amy;], 
where A yy V; is the image of Ay C M%*/n under the residue map U,,/n > Mv,/n 
induced by j,, : U,, — Mvs. By Kummer theory applied to both L|M and 
Lw;|Mvs, we conclude that Gal(L|M) — Gal(Lw;|Mv,) is an isomorphism iff 
Am — Amys is an isomorphism. From this we finally conclude that (Z/£)5 isa 
quotient of Ayv;. Therefore from the commutativity of the diagram of surjective 
morphisms 


A > A, —~ Am 


be dm 


JA) > Anv—> Amys 


it follows that 7,(A) has (Z/£)° as a quotient, because Ay v,; does so. But then since 
Az Zi, and 7,(A) has no torsion, being a submodule of the torsion free Z~-module 


Ry, it follows that Jv maps A isomorphically onto 7,(A). al 


Proposition 41. Let %, be a complete regular-like decomposition graph, which we 
view as a divisorial abstract decomposition graph, as indicated in Proposition 23. 
Then with the above notation, and that of Definition/Remark 31, fort € K the 
following are equivalent: 


(i) Dy, : Go > G, is a rational quotient of G,. 
(ii) «; is a rational function field. 


Proof. To (i) => (ii): First recall that Og, ~ as where g; is the genus of X;. 
Hence Y,, is rational if and only if g = 0, or equivalently, x; is a rational function 
field. 

For (ii) => (i), we first claim that Y,, is rational. Indeed, by the discussion above, 
U %, = 0 and Ely, = Zp, thus the claim. Next we claim that ®, : G, > %, 
defines Y,, as a quotient of Yy. Indeed, first ®,., : x — Ik, is surjective by 
the definition of «;. Thus it is left to show that ®,, is proper, i.e., to show that if 
Py 1% —> G, is a level-one residual morphism for ®,,, then the following hold: 


(a) Each 1-index of Y% is mapped under ®, to some multi-index of %,,. 
(b) Each multi-index of Y,, corresponds to some multi-index of %. 


To prove (a), let X, — k be a normal model of Ky|k such that Dy, is the set 
of all the 1-vertices of Y,. Then the embedding of k function fields 1 : kK; > Ky is 
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defined by some dominant rational map f : Xy --> X;. Since Xp is normal and X; 
is complete, it follows that f is defined at all points of codimension 1. This means 
that for every v € Dy,, if v is trivial on x;, then f(v) is the generic point of X;, and 
hence v is mapped to the trivial valuation of Y,,; and if v is non-trivial on «;, then 
J (v) is a closed point in X;. 

To prove (b), we proceed by induction on dy = td(K,y|k). If dy = 1, then 
X,y is a normal curve. Since %, was assumed to be divisorial, %, is divisorial 
too by definition. Hence by Proposition 23 (1), Xy — & is a complete normal 
curve. But then the dominant rational map f : X, --= X; is a surjective morphism. 
Finally, if dy > 1, then there exist “many” v € Dy, which are trivial on x;. But 
then ®, gives rise to a level-one residual morphism ®, : %, > Y, of %. Since 
td(Kv|k) < dy, by induction ®, is proper. On the other hand, the set of multi- 
indices %w, of Y, is contained in the set of multi-indices %,. Hence finally, every 
vertex of %., corresponds to some vertex of Gp. 

Finally, it is left to check properties (i), (ii), Remark/Definition 31 (2). 

Checking property (i) form Remark/Definition 31 (2): If a(Uy M ik) is non- 
trivial, then 7» maps K; injectively into g.. Indeed, let ) = v, o-++0 vy, with v; 
prime divisors. Then if » is not trivial on is then x, = k, and hence Jy is trivial 
on kK; Cx Second, if is trivial on «75 then k;b = K;; hence KC U», and Jy is 
injective on &;. 

Checking property (ii) form Remark/Definition 31 (2): Let us view K as a 
function field over the rational function field x;; hence td(K |k,;) = td(K|k) —1 > 0, 
and xk; is relatively algebraically closed in K. Moreover, after replacing x; by a finite 
purely inseparable extension (which is of the form «, with y” = t for some power 
p* of p = char(k), which does not change the Galois theory of the situation), we 
can suppose that K |; is actually a regular field extension. Let ¥ — k be any normal 
model of K|k, and let Pe be the projective t-line over k. Since K |x; is regular, by 
Fact 40 there exists a cofinite subset S C &k such that the prime divisor v, of K|k 
defined by the fiber X, C X at the point s € Pr defined by a € S restricts to the 
(t — a)-adic valuation of «;, and ¢ — a is a uniformizing parameter of v,. Now recall 
that K |x; is a (regular) function field over x, with td(K |x,) = td(K|k) — 1 positive, 
and one has an exact sequence of the form 


loki K* > K*/KX > 1. 


And note that the last group is a free abelian group, since it is contained in the group 
of Weil prime divisors of any projective normal model Y — x; of K |x;. Therefore, 
Khas complements, say .# C K™ in K™, and hence we have K* = «;*- # with 
HX a free abelian subgroup with “ M «* = {1}. Moreover, we can “adjust” “~ 
in such a way as to have v,(.%) = 0 for all a € S as above. Indeed, if (a;); is a 
Z-basis of %, then replacing each a; by f; := a; [[,e5(t — a)" (i), the resulting 
system (f;); generates freely a Z-submodule .“ of “% such that v,(.%) = 0 for 
alla € S, and .% is acomplement of «* in K™. 
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Therefore, we may and will suppose that .“ C K™ isa complement of « in K* 
such that vg (4) ) = 0, or equivalently % C U,,, for alla € S; and taking t-adic 
completions, KH isa complement of ; in K with the same property, i.e. hb aaie Va 
forallae S. 

Now let A Cc K fin be a Ze-submodule of finite corank, which means that 
v(A) = 0 for almost all v € D Dx. In particular, this implies that A C in _ for 
almost alla € S. Let A; C H and A>. C k;, be the projections of A on 2%. 
respectively K;. We claim that both A; and Aj have finite corank, i.e., v(A,) = 0 
and v(A2) = 0 for almost all v. Indeed, let &’ C Dy be the finitely many v € Dy 
such that v|,, is non-trivial, and v # v, for alla € S. Every x € A has a unique 
presentation of the form x = x;x2 with x; € Aj;, and clearly, v(x) = v(x1) + v(x) 
for all v. Then for v € Dy\& satisfying v(A) = 0, since v(x) + v(x2) = v(x) = 0, 
we must have v(x,) 4 0 iff v(x2) 4 0 for v € Dy. On the other hand, if v(x2) 4 0, 
then v|,, is non-trivial; hence v = vg for some a € S, by the fact that v € Dy\JZ. 
And if v = v, for some a € S, then W(#) = 0; hence v(x,) = 0. We conclude that 
forv € Dy\» we have v(A) = 0 iff v(A;) = 0 fori = 1,2. Thus both A; and A> 
have finite corank. 

Now since A; 1 &; = 1, it follows by Proposition 40 (3) that 7,, maps A, 
injectively into Kv for almost all a € S. Further, we notice that j,,, is trivial on 
Ap for almost all a € S. (Indeed, f € x; is a vg-unit iff a is neither a zero nor a 
pole of f; and if so, then y,,(f) = jv,(f(@)) = 1, because j,,(f(a)) € k*, and 
Jv, is trivial on k*.) Therefore, for x = x\x_ € A with x; € A; as above, one has 
Jvqg(X) = Jv, (%1). Hence for x € A as above we have j,,,(x) = 1 iff y,,(%1) = 1 iff 
x € Ao, as claimed. oO 


Va? 


Notations 42. We introduce notation as follows: 


(1) Let {k, = k(x) | x general element of K} be the set of all the subfields of K 
generated by general elements x € K. Note that k, = x, if and only if x’ is a 
linear transformation x’ = (ax + b)/(cx + d) of x. 

Further, let 2x = {®,,},, be the set of the corresponding rational quotients of 
4%, and note that ®,, = ®,,, if and only if x’ = (ax + b)/(cx +d) isa linear 
transformation of x. 

In order to simplify notation, we identify «, with the corresponding subfield of 
K. This identification defines a canonical embedding kK, > K which turns out 
to be the inflation map defined by the canonical projection ®,. : Zk > Ik,. 
Therefore, the £-adic completion homomorphism jx : K* > K then identifies 
J, (KX) with jx (K*) inside RK. 

Letz : L|l — K|k be an embedding of function fields such that 1(/) = k and 
K|.(L) a separable field extension. Let Ax = {®,.},, and By, = {WH}, the 
sets of all rational quotients of K|k, respectively L|/. Finally, let 8, C B, be 
the set of all Y%., such that 7(«,) is relatively algebraically closed in K. Thus in 
the context of Proposition 38, by taking into account Fact 41, for YW, € 8, and 
the corresponding ®,, € 2x, one has commutative diagrams in which Dy, 18 
an isomorphism: 


(2 


wa 


(3 


wm 
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IIx — II, Gy, —> Go, 
{Pex [Yr and | 2«x [Yes 
Pxxky Pry 
TT, kK, —_? TT, Ky G xo Ge y 


Fact/Definition 43. In the context from Notation 42 above, the following hold: 
Birational Bertini: Let x,t € K be algebraically independent over k, and let x be 
separable in K, i.e., x is not a p-power in K, where p = char(k). Then for all but 
finitely many a € k, the element ax + t is a general element of K, i.e., k(ax + ft) 
is relatively algebraically closed in K; see e.g. Lang [18], Ch. VII, Lemma in the 
proof of Theorem 7, or Roquette [32], §4. 


(1) We will use the above “birational Bertini” repeatedly in the following form: Let 
x,t € K be fixed algebraically independent functions over k, with x separable, 
e.g., general. Then the following hold: 


(a) ty := ax +t isa general element of K for almost alla € k. 

(b) tata = t/(a'x +a) is a general element of K for all a’ € k* and almost 
alla ek. 

(c) tatata = (a"t t+a’x +a4+1)/(t + a’x +a) isa general element of K for 
alla” €k and almost all a’,a € k. 


(2) For x,t € K as above, the general elements of the form fg, tga, ta'a'a, 
will be called general elements of Bertini type defined by x,t. Further, a set 
2S) C K~* will be called a Bertini set, if for all x,t € K which are algebraically 
independent over k, and x is separable, one has fg, te’a,tatata € & for all 
a” €k, and almost all a’,a € k. Clearly, © generates the multiplicative group 
K™~ by assertion (1) (b) above. 

We say that a set of rational quotients 2 C 2x is of Bertini type, if 21 has a 
subset of the form 25 := {®,, | x € X’} for some Bertini set & C K*. 
Next letz : L|] — K|k be an embedding of function fields such that 1(/) = k 
and K|1(L) separable. Then for every separable element y € L, one has that 
xX := 1(y) is a separable element of K |k. Further, directly from the definition of 
a general element of Bertini-type one gets the following: Let up, upp, Uap» € L 
be general elements of Bertini type defined by some y,u € L. Then for all 
b” € I and almost all b’ b ¢€ 1, the images ty := 1(yp), tore := t(Upty), 
th'bh = 1(Uprpy) are general elements of Bertini type in K|k defined by 
x :=1(y), t:=1(u). 
From this we deduce that there exist Bertini sets A C L* and © Cc K™* such 
that 1(A) C &. Therefore, for the corresponding Bertini-type sets of rational 
quotients %, and 25, we have that if kK, € By, then Kk, := 1(Ky) lies in 
As, etc. 


(3 


wm 


(4 


ww 


Proof. The only assertions which are perhaps not obvious are (1) (b) and c). 
To (1) (b): fara is general if and only if 1/tg, = a’(x/t) + a(1/t) is general. 
Now note that if x/t,1/t € K are algebraically independent over k, and because 
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x is separable, it follows that at least one of the two elements is separable. Finally 
apply the “birational Bertini’. 

To (1) (c): Leta := 1—a”. Then tgqqg = a" + (aa’x + aa 1]/(t +a’x +a) 
is a general element if and and only if ¢t’ := (aa’x + aa 4+ 1)/(t + a’x +a) 
is so. Note that tf + a’x +a is a general element for all a € k* and almost all 
a’ € k by the “birational Bertini.” Hence if w = 0, then rf’ := 1/(t + a’x +a) 
is a general element. Finally, if « 4 0, then x’ is a general element if and only if 
1/t' = (t — 4)/(aa'x + aa + 1) + + is a general element, thus if and only if 
(t — +) /(aa'x + aa + 1) is a general element. And the latter is a general element 
for all wa + 1 € k* and almost all a’ € k, by Case (1) (b). Oo 


Proposition 44. With the above notation, the following hold: 


(1) Suppose that td(K|k) > 1, and let GY, be a complete regular-like geometric 
decomposition graph, which we view as a divisorial abstract decomposition 
graph. Then endowing Gx with a Bertini-type set % C AK of rational 
projections, Ga, becomes a geometric like abstract decomposition graph 
satisfying the following: K?)) := jx(K™)®Zw is an arithmetical lattice defined 
by A inside RK, which we call the canonical arithmetical lattice. 

Let1 : L\l — Kl be an embedding of function fields such that 1(l) = 
k, and K\1(L) is separable. Let Hg, be a complete regular-like abstract 
decomposition graph for L|l such that 


(2 


wm 


D, : Gx, > Hg, 


gives rise to a proper morphism of abstract decomposition graphs. Then 
there exist Bertini-type sets & of rational quotients for Hq, such that ®, is 
compatible with the rational projections 8 and . 


Proof. To (1): Let us check that 2{ satisfies the conditions from Definition 33. Let 
X — k be a quasiprojective normal model of K|k such that Dy is the set of all 
1-vertices of Gy. 


Step 1. 2( is an ample family of rational quotients for %,. Indeed, first recall 
that by Fact 43 (2), the set X'qy generates K*. Therefore, with the notation from 
Definition 33 we have Aq, = Ki; hence Ag = Ko too. From this we deduce, 


first, that Ag is £-adically dense in K, as Jx(K”%) itself is so. Second, since Zr was 
supposed to be complete regular-like, for every non-constant x € K there exists 
v € Dy such that v(x) 4 0. Equivalently, for every non-trivial x € K (0) there exists 
ve D> such that v(x) 4 0. But this means exactly that Kip nN Og, is trivial. From 
this discussion, condition (ii) of Definition 33 follows. For condition (i), observe that 
®,, A P, implies that «, A «,. But then kK, 1 Ky = k; hence kK, and ky have 
trivial intersection inside K. 

Step 2. %, endowed with 21 is geometric like. 


Indeed, let «, and x,” be given. If k, = «,’, then there is nothing to prove. Hence 
let kK, A Ky. Since kK, and kK,’ are relatively algebraically closed in K, it follows that 
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x,x’ are actually algebraically independent over k. Therefore, by the “birational 
Bertini,” it follows that for almost all a,a’ € k we have that t := ax —a’x’ gives 
rise to a dominant rational map f : X -->P} such that for general points t = b, 
the fiber X, is a Weil prime divisor of X, and x, x’ are non-constant on X;,. The 
birational translation of this is the following: If v := vy, € Dy is the corresponding 
prime divisor of K, then x, x’ are v-units such that j,(x), y)(x’) are not constant in 
the residue field Kv of v. But then «* and «*, consist of non-principal v-units, and 
are mapped isomorphically into the residue field Kv. Moreover, since t = ax —a'x’ 
has v(t) > 0, it follows that ax = a’x' (modm,), hence j,(k,) = Jy(Kx’). Taking 
£-adic completions, we deduce from this that conditions j), jj) of Definition 33 are 
satisfied at v. 

To (2): Apply Fact 43 (3), (4), and the commutative diagrams from Nota- 
tions 42 (3). oO 


6 Proof of Main Theorem 


In this section we will give a proof of the Main Theorem from the introduction. We 
will actually prove a slightly more general result than the Main Theorem announced 
in the introduction, in the sense that the first part of the theorem proved below 
compares complete regular-like geometric decomposition graphs with geometric- 
like abstract decomposition graphs. 


Theorem 45. Let K|k be a function field with td(K|k) > 1, and let %, be a 
complete regular-like geometric decomposition graph for K|k. We endow Gq, with 
a Bertini-type set 21 of rational quotients, and view it as a geometric like abstract 
decomposition graph. 


(1) Let # endowed with a family of rational quotients 8 be a geometric like 
abstract decomposition graph. Then up to multiplication by €-adic units and 
composition with automorphisms ®, : Go, — Gq, defined by embedding of 
function fields 1 : K|k — K\k such that K\i(K) is purely inseparable, there 
exists at most one isomorphism ® : Gj, — HH of abstract decomposition 
graphs which is compatible with the rational quotients 21 and B. 

(2) Let L|I be a further function field with td(L|l) > 1, and let Ho, be a complete 
regular-like abstract decomposition graph for L|l. We endow Hg, with a 
Bertini-type set 8 of rational quotients, and view it as a geometric like abstract 
decomposition graph. Let 

@: IIx > TT, 


be an open group homomorphism which defines a proper morphism 
® : Ga, — Hg, of abstract decomposition graphs compatible with the ratio- 
nal quotients 8 and 2. Then there exist an €-adic unit ¢ and an embedding of 
function fields 

1: LIl—> Kk 
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such that ® = ¢-©®,, where ®, : Gy, — Hg, is the functorial morphism 
of decomposition graphs defined by 1 as indicated above. Further, 1(1) = k, 1 
is unique up to Frobenius twists, and € is unique up to multiplication by p"- 
powers, where p = char(k) andn € Z. 
Proof. Since (1) follows from (2), it suffices to prove assertion (2). 
Recall that by Proposition 44, Ki ‘= Jx(K*) ® Zw and Loy := Ji(L*) ® Ze) 
are arithmetical lattices for %, endowed with 21, respectively for #, endowed 
with %. Now by Proposition 35, it follows that (Li) is contained in a unique 
arithmetical lattice of %,. Since the arithmetical lattices of %, are ¢-adically 
equivalent to K (o> there exists an ¢-adic unit ¢ such that 


(Lin) Ge-Ki. 


Therefore, after replacing ® by ¢- ®, without loss of generality we can make the 
following hypothesis: 
Hypothesis I. b maps Liy isomorphically into Kip: 


We further recall that 7,(K™*) = K*/k* and j,(L*) = L*/1™ are true lattices in 
K (a) respectively Lip: In order to simplify notation, we denote by 


X= x(x) =k*x, y= yi(y) = ly 


the image of x € K™ under jx, respectively that of y € L* under j,. Further, we 
will always denote elements of K, (2)? respectively of Lo in boldface: 


xe Ki, ye Lo. 


Next we want to remark the following: Let ®,. € 2 correspond to some 
®,, € B, and let Kye) C Ky and Ky(e) C Ky be the unique divisorial lattices such 
that be, (Kx @)) C Ko; respectively bx, (Ky (ey) C Lo: Then by Proposition 35 (2), 
we get Px.xy (Ky) = Kx.(Q> PPK, (Kye) = Prey (Kx). Hence taking into account 
the identifications from Notations 42, i.e., kx (¢) = K, (2) Nok, (Kx) inside K = Adin F 
and ky (¢) = Lo M1 bk, (ky) inside L = A #o,, the above assertion is equivalent to 
the fact that if ®,, € 2l corresponds to Py, € 55, then 


Plky()) = Kx) 


Hence we have IL(k¥) C Kyq and sx(kKX) C kx 2, and the task now is to 


understand the precise relation between b © pi(Ky*) and jx (K~) inside Ky (ey. 


Lemma 46. Let ®,. € 2X correspond to some ®,, € 8. Then there exist unique 
relatively prime and prime to £ integers m,n > 0 such that the following hold: 
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(1) b(n ye (ly +))%) = m- pe (kx +k)% and O(n: Ji (e)) = m- JK (KX) in kyo. 
(2) o(n-Jx(y)) = m- JK (x), provided Zi - jy) is mapped by @ into Z)+ JK (x), 
and such a choice of a generator x for kx is always possible. 

Proof. We begin by considering the systems of arithmetical inertia generators & x, 
of {,, respectively T,, of Y,, as introduced at Definition/Remark 19 (2). We let 
(vy) = w’ —w be the divisor of y € x, = /(y), and (x) = v’ — v be the divisor of 
x € kK, = k(x). Thenx, = k(x), and in the notations from Definition/Remark 31 (1), 
we have: 


(a) Y:= Jz) = Ow — Gv, and A, = A, = jgr(ly + 1)* C ky is the 
generating set at w with respect to Z,. 
(b) X := jx(x) = oy —@, and A = A, = jrlkx +k)* c Kx e) is the 
generating set at v with respect to Z,,. 
Moreover, we can choose x from the beginning in such a way that v’, v are the 
preimages of w’, w under ®,,,,. Equivalently, we have (Zu )'Y) = Z-x. Therefore 
there exist unique relatively prime integers m,n > 0 such that 


(*) d(n-y) =m-x. 


On the other hand, the image ®,,., (&.,,) of L,,, under the isomorphism ®,,.,, is a 
distinguished system of inertia generators for «, such that b( A,,) is the generating 
set at v with respect to D,,., (X.,). By the uniqueness up to €-adic equivalence of the 
distinguished systems of inertia generators we have Px,xy (Ze, ) = T, for a unique 
t-adic unit « € ZX. Hence 6(A,) = e—! - Y,, and in particular, g(y) = e7! - x 
inside «, (e). Then by the fact (*) above, it follows that e = n/m; hence both m,n 
are relatively prime to ¢. Finally, we get 


(x)! d(n- Py) =m: Py. 


Clearly, if m’,n’ are relatively prime integers such that $(n’ - Py) =m'- Px, then 
we must have $(n’ - y) = m’-x. Therefore, (m,n) = (m',n’) by the uniqueness 
of m,n. 

Finally, since A, and Y, generate «*/1* inside ky (e), respectively «/k™ inside 
Kx,(¢), We deduce that for the unique m, n above, one has 


(*)” b(n : IL(ky)) =m- JK(kx). 


This completes the proof of the lemma. oO 


Norming 47. Inthe context of Lemma 46 above, suppose that Ze): JL(¥) is mapped 
by d into Zip) - JK (x). Then we will say that pi is y-normed if d ° JL(y) = JK(X). 

Clearly, a priori, é might not be normed with respect to any ®,, € B and the 
corresponding ®,, € 2. Nevertheless, we can “artificially” remedy this as follows: 
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With the notation from Lemma 46 above, suppose that we have chosen the generator 
x such that Z¢) - 7, (y) is mapped by d into Ze): x(x). Hence we have go IL(Y) = 
(m/n)- x(x). Further, notice that 7 3 :=m /n is an £-adic unit. And replacing the 
morphism ® : [7x — IT, by its 7 F; -multiple &’ := n te @® amounts to replacing db 
by its (1/7ng)-multiple ¢' — (1/n3) . @. In particular, we have nz, = 1; hence OM is 
y-normed. 

Hence we have the following: Let x, € % and its corresponding kK, € 2 be given 
such that Ze) - Jz (y) is mapped by b into Ze) - jx (x). Then after replacing ® by a 
properly chosen multiple 7-® with n € Z, the resulting Kummer homomorphism 
(1/n) - d is y-normed. Hence mutatis mutandis, we can suppose that d satisfies the 
following norming hypothesis: 


Hypothesis ITI. «, € 2 corresponds to ky € B, and ° JL(y) = JK(%), hence d is 
y-normed. 


Remark/Notation 48. If b is y-normed, then by Lemma 46, b defines bijections 
(H) b: lly +X > prky + %, Bb: (@) > Jue). 


We set Mx := b(j1(L*)) N yx(K*), and let M; © j,(L”*) be the preimage of 
Mx under @. Then jr(ky) C Mz and jx(k~) C Mx by the fact (+) above, and 
notice that 

¢: M, > Mx 


is an isomorphism which maps j,(k;) isomorphically onto jx (k~). We will say 
that u € L* andt € K* correspond to each other if the following hold: 


yi) € Mz, gx(t)€ Mg, and poy, (u) = sr(t). 


Finally we notice that M;, ® Zw) = Li inside L. 


Lemma 49. Suppose that t € K andu € L correspond to each other via d. Then 
P= (kt +k)Vk*=jp(kt+k)* C Mx, Ay i= (lutl)¥l*= 7, lu+l)*CM_z. 


Proof. Case 1: u € xy. Thent € xx, and we are in the situation of Lemma 46 
above with m = n = 1, from which the assertion follows. 

Case 2: u ¢ ky. Since k, = /(y) is relatively algebraically closed in L, it follows 
that u, y are algebraically independent over /. Correspondingly, the same is true 
for t,x, i.e., t,x are algebraically independent over k. Then by the Fact 43 (1), 
we have: 


(i) tag 2= t/(a'x + a) is a general element of K for almost all a’,a € k. 
(ii) upry = u/(b'x + D) is a general element of L for almost all b’,b € /. 
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Hence by condition (+) of Remark/Notation 48, we conclude the following: For 
a’,a as at (i), let yarq € (Ly + 1)* be such that db © Jt(Vata) = JK(a'x +a). Then 
by (ii), Uata 1= U/ Vara is a general element of L for almost all a’,a € k. And note 
that 7 

@ ° JL(Uata) = d © JL(Uu/ Vata) = IK (t/(a'x ale a)) = JK (tata). 


In particular, since 21 and 5 contain some Bertini-type subsets, we can suppose 
that Krys, € 2 and Kur € 8, and Kray, corresponds to Kura under ®. On the 
other hand, since by hypothesis we have JK(t) € Mx and L(u) € M_, and by 
Remarks/Notation 48 above, jx (kx +k) C Mx and jz(ly + 1) C Mz, it follows 
that for almost a, a’ € k, the following hold: 


(a) tata € K and ugg € L, respectively tga41 € K and ugra+1 € L, are general 
elements which correspond to each other under @. 
(a)’ Hence ¢ is normed with respect to both ugrq and ug'g+1. 


For b,b’ as at (ii), let xy, € kx + k be such that b(si(b'y + b)) = sx (xs). 
Then by (i), for all b and almost all 5’, the element ty, := t/Xxp’p is general, and 
note that 


IK (to) = JR(t/Xv5) = 6 © Ji (u/(b'y + b)) = $0 JL (wy). 


In particular, «;,,, € 2and ky,,, € 8, and Ky, corresponds to K,,,, under ®, and } 
is normed with respect uj/,. Reasoning as above, for almost b’, b € / one has: 


(b) typ € K* and upry € L, respectively ty,41 € K and up:p41 € L, are general 
elements which correspond to each other under ¢. 
(b)’ Hence ¢ is normed with respect to both uy, and uy: p+41. 


But then by the fact (+) from Remark/Notation 48 applied to the functions 
tata © K™ and ugg € L, it follows that jx (kX, ) = d ° JL (K; ; yc b o yL(L*), 


and therefore we also have jx(k*,) C fo pp (L*) N 7x (K*) = Mx; and the 
same holds correspondingly for the other three pairs of functions which correspond 
to each other under ¢. Thus finally we get 

gel, )C Me, ie, CMe Ji@,,)C Mir Jel, 0.) C Ua, 


ty! atl Upl b+ 


Finally, for a,a’,a” € k, consider the functions 
tatata = (a"t +a'xt+at+1)/(t+a'x+a). 


Then by Fact 43 (1), it follows that for all a”, and almost all a’, a, the function tgvq"q 
is a general element of K too. On the other hand, a direct computation shows that 


axtat l a"tyatit1 
a'x+a tata + 1 


talaa = 


584 F. Pop 


Since the images via jx of both the denominators and the numerators of the fractions 
above lie in Mx, we get Jx(taa'a) € Mx. Reasoning as previously in the case of 
ta’a, We find general elements Ugq7q € L such that Ktattala corresponds to Kujtigtgr CC: 

And we further define correspondingly functions 


b’x+b+1 bY up p41 +1 
b'x +b Upp + 1 


Ub'bib = 


and find functions tyvy, € K, etc. Finally one gets IK Ktarata) C Mx for all 
a” ¢€ k, and almost all a’,a € k. And correspondingly Jz (ku,,,) C Mz for all 
b"” €1, and almost all b’, b € 1. 


Now we conclude the proof of the fact that 7x (kt +k)* C Mx as follows: First, 


"blb 


since JK (kK; ‘ig ) C Mx, we have Jx (tavaa — 1) € Mx. On the other hand, 
tatata ~1 = [(a"” —1)t +1)/(t +a’x +a). 
Now observe that tf + a’x + a = (a’x + a + 1)/to,a7a. Hence 


JK(tta'x+a) € Mx, as Jx(to.ata), Jk(a’x+a+l1) € Mx. Thus we finally deduce 
that IK ((a" —1)t+ 1) € Mx foralla” € k. Hence FA, = px(kt + k)* C Mx, 
as Jx(t) € Mx by hypothesis. In a completely similar way, one concludes that 
P, = Jrxut+l)* Cc My. oO 


Lemma 50. Let Ko = jxz'(Mx) U {0} © K and Lo = 3;,'(Mz) U {0} C L be 
the preimages of Mx, respectively M_, in K, respectively L, together with 0 added. 
Then Ky © K and Lo © L are function subfields. 


Proof. Indeed, since Mx is a subgroup of K, its preimage Jz. (Mx) in K* isa 
subgroup too. We check that Ko is closed with respect to addition: For t,t’ € Ko 
non-zero, t” = t'/t € Ko,andt+t’ = t(t” + 1). On the other hand, by Lemma 49 
we have t” + 1 € Ko. Hence finally we get ¢ + ¢’ = t(t” + 1) € Ko. The proof of 
the assertion concerning Lo is similar, and we omit it. oO 


Next we observe that Mx = jx(Kj) = Kj /k* can be viewed in a canonical 
way as the projectivization A(Ko) := Ko /k~ of the infinite-dimensional k-vector 
space (Ko, +). And correspondingly, M; = j,(Lo) = Li /1* =: A(Lo) is the 
projectivization of the infinite-dimensional I-vector space (Lo, +). And since the 
Kummer homomorphism b: Lok maps M, bijectively onto Mx, the restriction 
of d defines a bijection: 


¢ = $| 19) : P(Lo) = Mi > Mx = A (Ko). 


Notice that the lines in #(Ko) are subsets of the form |, := (kto+kt))*/k* with 
to, t) k-linearly independent functions in Ko. In particular, setting ¢ := t,/to, we see 
that b+, = to: A;, where Y, -= (kt +k)*/k* = jx(kt +k)”. Further note that 
(o.t, depends only on to = jx(to) and t; := jx(t1), and not on the functions fo, t 
themselves. We will therefore also write +, for the line [,,-,, and A, for Y. 
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Correspondingly, the same holds for lines in A (Lo). 


Lemma 51. The morphism ¢ : P(Lo) > A(Ko) respects colineations; more 
precisely, @ maps each line (yy.u, C A(Lo) bijectively onto lty).t, C P(Ko), where 
fo) = (Uo), t = o(ur). 


Proof. Setting t = (u), we get ly.t, = to: f and (y,.u, = Uo - lu. Hence taking into 
account that ¢ respects the multiplication, it follows that it is sufficient to show that 
@ maps YP, bijectively onto A, provided t := (u). 

Recall that d is y-normed, and é(y) = X, where y = jx(y), X = Jx(X), 
for x and y corresponding to each other under d. Moreover, by fact (+) from 
Remark/Notations 48, ¢ maps Py = Py bijectively onto A, = PY. Recall 
that for every 1-index v of %,, and the corresponding 1-index w of Hg, , one has 
commutative diagrams of the form, see Remark 26 (3), and (4) 


w 
a Jw 


J 


Uy —-— Lw ra —e ZiQw 
ee ee ee 
U, > Kk K > Zipp. 


Let x; be the relative algebraic closure of k(t) in Ko. We claim that 
o(Px) C JK(k;). Indeed, let v be such that v(t’) 4 O for some t = ¢(U’) 
with u’ € Yy. Then by the commutativity of the second diagram above we get 
w(u’) # 0. But then it follows that j,, is trivial on /(u)* M U,. Hence by the 
commutativity of the first diagram above, it follows that , is trivial on go J, ( (u)*) ; 
in particular on #(“u). By contradiction, suppose that 6(Y,) Z yx (k:). Then J 
u; € #, and t; € Ko such that ¢ and f; are algebraically independent over k, and 
Jxk(ti) =: t, = @(u;). On the other hand, since f, t; are algebraically independent 
over k, there exist “many” v satisfying the following: v is not trivial on k(t) and 
t is a v-unit, and v is trivial on k(¢,). Note that v being non-trivial on k(t) and t 
being a v-unit implies that the residue of ¢ at v lies in k; hence j,(t) = 0. Now 
let w correspond to v under ®. Then by the commutativity of the above diagrams, 
U = jy(u) is a w-unit, and j,,(u) = 0. Therefore, w is non-trivial on /(u). Further, 
U; = J_(uz) is a w-unit, and 7,,(u,;) 4 0. Hence w satisfies both that w is non-trivial 
on /(u) and that j,, is non-trivial on Uy, M /(u)*. Contradiction! 

Now choose a prime divisor v of K|A such that the following are satisfied: 


(i) vis trivial on ky, and ¢ is a v-unit. 
(ii) x and ¢ have equal residues in Kv; hence j,(x) = J,(f). 


Note that (ii) implies that v is trivial on x; too, hence 7, maps both « and x 
injectively into the residue field Ky. 

Now let w correspond to v under the proper morphism ® : %&, > G,. Then 
reasoning as in the proof of Proposition 35, it follows that the following hold: 


(j) w is trivial on ky. 
Gj) y and u have equal residues in Lw; hence j,(u) = Jw(y). 
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Further, since j,, and 7, respect addition and multiplication, the following hold: 
TWA(A:) = P W(t); IW(Px) _ F yx)s and Iwl Au) = P j(u)s Iw(Yy) = P jy) . 


Hence by (i), (ii), respectively (j), (jj), we get F,,7) = Fyx) and YF) wy = Fy). 
Since 6(Y,) = Y, by the choice of x and y, it follows from dy © Jw = Jvog that 


b(Pryo)) = by (Jw(Py)) = I(G(Py)) = IWPYx) = F(x) . 


Since j, © db = dy © Jw, from the equalities above we finally get 


I($(Au)) = dv (Jw(Pu)) = bv( Fw) = b(P w(y)) P (x) P jt) IWWA1) : 


Hence /, (b(P.)) = Jy(Y,). Since both b(Py) and Y, are subsets of jx (k;‘), and 
Jy is injective on jx (k;‘), we get b( Pu) = F,, as claimed. Oo 


In order to conclude the first part of the proof of Theorem 45 we proceed as follows: 
By Lemma 50 above, @ respects colineations. Therefore, by the fundamental 
theorem of projective geometries, see e.g, Artin [1], @ is the projectivization 
ob = Y(¢') of some linear 19-isomorphism ¢’ : (Lo, +) — (Ko, +), ie., there 
exists a field isomorphism 19 : / — k, and ¢’ is an isomorphism of abelian groups, 
such that $’(au) = 19(a)¢’(u) for all a € J and u € Lo. Moreover, ¢’ is unique up 
to composition by homotheties of the form J, o ¢’ 0 1; (alla € k, b € 1). Further, 
since k* = ker(jx) and /* = ker(J,), it follows that @’(/) = k. We set 


po := (1/9) ¢’, 


and claim that @o is a field isomorphism which maps / isomorphically onto k. In- 
deed, fora fixed y € Lo, consider d, : Lo > Ko defined by $y (u) := ¢o(yu). Then 
oy is a linear 19-isomorphism. Set x = ¢o0(y). Then considering projectivizations, 
and using the fact that @ = Y (go) is multiplicative, it follows that for all u € Lo 
we have 


Pgy)\(W) = Alou) = Alix) 0 Poo), 


where /, is the multiplication by x on Ko. Therefore, there exist a € k* and b € 1* 
such that /, 0 py ol, = 1, odo. In other words, a do(y bu) = x do(u) for allu € Lo. 
Setting u = 1, and taking into account that @o9(1) = 1, we have a 19(b) x = x. Thus 
aig(b) = 1, and hence the effects of /, and /, cancel each other. Hence we have 


do(yu) = x Pou) = poly) dou), (allu, y € Lo), 


hence @¢o is a field morphism. And since ¢o(Lo) = Ko and ¢o(/) = k, it follows 
that do : Lo|/ — Ko|k is an isomorphism of field extension, as claimed. 
Finally, in order to conclude the proof of Theorem 45 we prove the following: 
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Lemma 52. L|Lo is a purely inseparable field extension. 


Proof. First, recall that by the last assertion mentioned at Remark/Notations 48, we 
have M; ® Zi = Liy inside L. Equivalently, for every u € L™ there exists a 
prime to ¢ integer, > 0 such that wu" € Lo. This means in particular that L|Lo is 
an algebraic extension. Since L|/ is a function field over the (algebraically closed) 
field 7, it follows that Lo|/ is so, and L|Lo is actually a finite field extension, of 
degree [L : Lo] = n > 0. From this we deduce that if 7, is minimal such that 
u™ € L, then n,|n. In particular, all the nth powers u”, u € L, are contained in Lo. 
But then n = [L : Lo] must be a power of the characteristic, as claimed. Oo 


For the uniqueness of 7 up to Frobenius twists, one uses the last lemma above, 
and applies Proposition 38. This completes the proof of Theorem 45. oO 


7 Appendix 


Here we recall a few facts concerning the pro-¢ abelian quotient of the fundamental 
group, and facts concerning the structure of the divisor class group as an abstract 
group. All these seem to be folklore and might be well be known to the experts, but 
I cannot give a quick reference. Throughout this section, K |k is a function field over 
an algebraically closed field k of characteristic p > 0. 


7.1 The Kummer interpretation of ,,¢((X ) 


Let K|k be a function field, and X — k anormal model of K|k. Recall that Dy 
is the set of prime divisors of K|k which are defined by the Weil prime divisors 
of X, and divy : K* — Div(X) the corresponding divisor map. Then denoting by 
Ux := I'(X, Gx)” the invertible global functions on X, one has ker(divy) = Wy, 
and Wy /k* is a finitely generated free abelian group. Finally, one has an exact 
sequence of the form 


1s YW > K*/k* = FA (K) > 0, 


where 7 (K) := divy(K) is the group of principal (Weil) divisors of X, and the 
exact sequence defining the divisor class group is 


11> A (K) > Div(X) > CU(X) > 0. 
We now want to recall the Kummer interpretation of the prime to the characteristic 


torsion of €[(X), which is as follows: Let char(k) = p > 0 be the characteristic of 
k, and 7 a positive integer not divisible by p. Then tensoring the last exact sequence 
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with Z/n, we get the exact sequence 
0 > ,€UX) O K&(K)/n > Div(X)/n > CU(X)/n > 0, 


where ,€I(X) is the n-torsion of €{(X). In particular, if we denote by A, the 
preimage of ,€1(X) C #\(F)/n in H\(K), it follows that we have a canonical 
exact sequence 


0> A, @ K*/n > Div(X)/n > EU(X)/n > 0, 


because K*/n = (K*/k*)/n by the fact that k being algebraically closed. Notice 
also that A,, fits into an exact sequence of the form 1 > Yy/n > A, > ,€l 
(X) — 0; thus it is a quotient of (Z/n)! for some index set I. 


Fact 53. In the above context, set Ky, := K|/A,]. Then K,|K is an n-elementary 
abelian extension of K satisfying: 


(1) Gal(K,,|K) = Hom(A,, f4,) canonically. 

(2) K,|K is the maximal n-elementary abelian extension of K in which all v € Dx 
are unramified. 

(3) In particular, for € # char(k), let Keoo|K be the maximal pro-€ abelian 
extension in which all v € Dx are unramified. Then 


Gal(Kyso|K) = TT,py = Hom(Aco, pee), 


where Ag = lim A, fits canonically in0 > Wy ® Qe/Ze > Aw > 


po €l(X) > 0. 
(4) Finally, ,€UX) and K,|K are finite by Fact 54 below. 


Proof. The first assertion is clear by Kummer theory. For the second assertion, 
consider the following commutative diagram: 


11> A, -— K*/n => Div(X)/n > €1(X)/n > 0 


1 1 | | 
0> ,€UX) ~ %&%&(K) > Div(X)/n > E(X)/n > 0 


We first show that all v € Dy are unramified in K,,|K. Equivalently, we have to 
prove that for all f © K™%* whose images lie in A,, all v € Dy are unramified 
in Kt )| K. Now if the image of f € K™* lies in A,, then the image of f in 
Hx (K) lies actually in ,€((X). Therefore, the divisor (f) of f has trivial image 
in Div(X)/n; in other words, there exists some divisor P € Div(X) such that 
nP =(f), ie, vf) €n-vK for all v € Dy. But then by Hilbert decomposition 
theory, it follows that every v € Dy is unramified in K /F )|K. Since f was 
arbitrary, it follows that all v € Dy are unramified in K,,|K. 

Conversely, for f € K*, suppose that all v € Dy are unramified in K Gf )[K. 


Equivalently, v(f) € n-vK for all v € Dy; hence divy(f) € n- Div(X). Thus 
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there exists some divisor P € Div(X) such that divy(f) = nP in Div(X). But 
then divy(f) € #(K) has a trivial image in Div(X)/n, and therefore, the image 
of f in K*/n lies actually in A, as claimed. 

Finally, the proof of assertion (3) follows from assertion (2) by “taking limits.” O 


7.2 On the Weil divisor class group ¢1(X ) 


We want to say a few words about the divisor class group CI(X) := Div(X)/H%% (K) 
(as an abstract group) of a normal model X¥ — k of K|k in more detail. 


Fact 54. Let k be an algebraically closed field, and X — k be an integral normal 
variety. Then the divisor class group €\(X ) can be written as a (direct) sum: 


(*) CUX) = A(X) + Ai(X), 


where A(X) is the maximal divisible subgroup of CX), and A\(X) is a finitely 
generated abelian group. Further: 


(1) The maximal torsion divisible subgroup A;(X) © Ao(X) of CU(X) is a quotient 
of (Q/Z)’ for some r > 0. 
(2) Ifk is an algebraic closure of a finite field, then Ay(X) = A,(X). 


Proof. First, since Ag(X) C €I(X) is the maximal divisible group, it follows that it 
has complements A(X); hence €l(X) = Aop(X) + A(X) as a direct sum. Let K be 
the function field of X¥. Let Y — k and X — k be normal models of K|k such that 
Dy C Dy. We first claim the following: 


Claim 1. €1(X) satisfies («) iff C((Y) satisfies (*). 


Indeed, let Ag(X) C €l(X) and Ag(Y) C Cl(Y) be the (unique) maximal divisible 
subgroups. Setting S := Dy\Dy, we have that S is finite, Dy = Dy U S, 
and the canonical projection map pr : Div(Y) — Div(X) has as kernel As := 
Yves Zv. Thus the canonical projection pr : divy(K*) — divy(K”) has kernel 
Us := divy(K”*) N As, which a finitely generated group, and the canonical map 
pr: €(Y) > C(x) has kernel As /Y%s. Further note that pr( Ao(Y)) C A(X), 
because the former group is divisible, thus contained in the unique maximal divisible 
subgroup Ao(X) of €l(X), and pr : Ao(Y) — Ao(X) has kernel Aso := 
(As/Ws) N Ao(Y), which is a finitely generated group, as As/Ws is so. Hence 
setting Ay := (As/Ws)/Aso, we finally get an exact sequence of the form: 


(7) 0— As > A(Y) > A(X) > 0. 


Since As is a finitely generated group, it follows that A(Y) if finitely generated iff 
A(X) is so. This concludes the proof of Claim 1. 

We next notice that the assertion of the above Claim | holds in the same form for 
the Cartier divisor class group Ca€I(X), and the proof of this fact is word-by-word 
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the same as in the case of €((X). Further, if X is smooth (enough factorial), then 
CaCl(X) = Cl(X). 

We conclude the proof of Fact 54 as follows: Let X¥ — k be an arbitrary normal 
integral variety. Further, let X > kbea projective normal model of the function 
field K := x(X) of X > k. Then X and ag are birational, and hence they have 
isomorphic open subsets U C X and U Cc X. Moreover, we can suppose that 
U = U are actually smooth over k. By Gan 1 and its Cartier form, it follows 
that, first, €{(X) has the structure (x) iff €((U) does, and second, Ca€I(X) has the 
structure (+) iff €a€l(U) does. On the other hand, since U X U is smooth over k, 
hence factorial, we have Cl(U) = Ca€l(U). Thus finally, it is sufficient to show that 
€a€l(X) has the structure (*). In order to conclude, recall that for X > k projective 
integral normal, the structure of CaCl(X ) is known: Indeed, by Kleiman [14], 
Theorem 4.8, Theorem 5.4, Corollary 6.17, Remark 6.19, it follows that Cacl(x j= 
Pic(X )(k) has the structure (+). This concludes the proof of Fact 54. Oo 


Fact 55. In the above context and notation, let X —> k and Y — k be 
normal models of K|k such that Dy C Dy. In the notations from the proof 
of Claim 1, let Ajy(X) C Ao(X) be the torsion subgroup, whence Ajoy(X) is 
divisible too; and let A,(X) © €l(X) be the preimage of the torsion group of 
CU X)/Ao(X), hence A,/Ao(X) is finite, and CU(X)/A_,(X) is finitely generated 
ay abelian. The projection pr : E\(Y) > €U(X) gives rise to homomorphisms 

:A,(Y) > A,(X), pty : Ao(Y) > Ao(X), Ptyoe : Ator(Y) > Ator(X), and the 
pion hold: 


tor 


(1) pr,,, “as finite kernel and cokernel isomorphic to (Q/Z)', where r the rational 
rank of (As /Ws)N Ao(Y ). Therefore, Ator(X) = Ator(Y) B (Q/Z)" as abstract 
groups. 

(2) The rational ranks of A\(Y), A(X) satisfy (A (Y)) a rr(A;(X)) + |S|—r. 
Hence one has rr(A;(Y)) = rr(A1(X)) +|S| iff Ys is trivial iff YW = Uy. 

(3) If the equivalent conditions from (2) above are satisfied, then pt,,. and pty are 
isomorphisms, and pr, maps A,(Y)/Ao(Y) injectively into A,(X)/Ao(X). 

(4) Suppose that X has/satisfies the following equivalent properties: 


(i) A,(X)/Ao(X) and Ajo,(X) are minimal. 
(ii) A\(Y) & A(X) @ Z!®¥\Px! for all normal models Y — k of K|k with 
Dy CY. 
(iii) |A;(X)/Ao(X)| is minimal and rr(A\(Y)) = rr(A,(X)) + |Dy\Dy| for 
all Y as above. 


Then At, G Ao(X) © A(X) are birational invariants of the function field 
K(X). 


Proof. Everything follows immediately from the proof of Claim 1. For asser- 
tions (2), (3) and (4), use the exact sequence (7). In particular, note that if the 
equivalent conditions from assertions (2) are satisfied, then Ws trivial, and therefore 
Aso = AN Ao(X) is a torsion subgroup of the free abelian group As. Hence 
Aso is trivial, and As = As is a finite free Z-module. Thus the exact sequence (+) 
becomes 0 > As — A(Y) > A(X) — 0, etc. Oo 
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7.3 On the (pro-£ abelian) “‘birational’’ fundamental 
group [ix 


Let K|k be a function field. It is well known that if K|k has regular complete 
models X —> k, then the “usual” fundamental group 7, (X ) whose open subgroups 
parametrize all the étale connected covers of X is a birational invariant of K|k, 
in the sense that z;(X) does not depend on the particular regular complete model 
X — k. In general, one can consider the following replacement for the fundamental 
group of complete regular models: Let Dx), be the set of prime divisors of K|k. 
For every v € Dxjx, let ¥ be prolongations of v to an algebraic closure K of K, and 
Tz C Gx the inertia group of ¥ in the separable closure K*|K of K in K. Then the 
prolongations v are conjugated under Gx; thus the set of all the inertia groups 7; is 
closed under Gx-conjugation. Therefore, the closed subgroup Tx C Gx generated 
by all the 7; for all v € Dx |x and all their prolongations Vv to K is anormal subgroup 
of Gx. Moreover, setting 
1K 1= G/T, 


we see by the functoriality of Hilbert decomposition theory that the fixed field K of 
Tx in K* is the maximal field extension of K in which all the prime divisors v of 
Kk are not ramified. We will call 21x the birational fundamental group for K|k. 

More generally, let D C Dx\x be any set of prime divisors of K, e.g., D = Dx 
is the set of prime divisors defined by the Weil prime divisors of a normal model 
X — k of K|k. Then we denote by Tp C Gx the subgroup generated by all the T; 
with v € D and ¥ all the prolongations of v to K. As above, Tp is normal in Gr, 
and the quotient 

1p = Gx/Tp 


will be called the fundamental group for D. We notice that open subgroup of 
Jt\,p parametrize all the finite extensions L|K of K in which all v € D are not 
ramified. 

In the same way, we introduce/define the pro- abelianizations of the fundamen- 
tal groups introduced above, 


IT.x = Tg / Tog, and IT\.p = ITx/Tp, 


and call them the pro-£ abelian birational fundamental group for K|k, respec- 
tively for D. 


Remarks 56. Let X — k be a normal model of K|k, and 7(X) > IT, (X) := 
re ) the canonical projection. In the above context and notation, the following 
hold: 


(a) Forevery D C Dx there are canonical surjective projections 71,p — 71,x, and 
I1.Dy — 11(X), which by the functoriality of Hilbert decomposition theory 
give rise to surjective projections IT, p > IT,_x and IT; py > IN\(X). 
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(b) Nevertheless, if X is regular, then 2),.p, = 7)(X), thus also IT, py, = IN(X), 
by the purity of the branch locus. 

(c) And if X is complete and regular, then mj.x = 2(X), thus also 
IT, x = IT,(X), by the purity of the branch locus. 


Fact 57. In the above context, the following hold: 


(1) Let X — k be a complete normal model of K\k, and U C X is a regular 
open subvariety. Then there are canonical surjective projections 1(U) > 
‘(se ad m\(X). 

(2) There exists a geometric set Dx such that II\,ny — IT,x is an isomorphism. 
Hence IT, p,, — I,K is an isomorphism, provided Dy © Dy. 

(3) Let X be an affine normal curve with IT py = [\,x. Then either X = Ap or 
X is isomorphic to E\{pt} with E a complete curve of genus one. 

(4) If X is a normal model of K|k such that II,,p, = II\,x, then the group of 
global invertible sections on X is Uy = k*. 


Proof. To (1): The existence and surjectivity of 7;(U) — 2.x follow from (a) 
and (b) above. For the existence and surjectivity of 2|.x — m,(X), we notice first 
that since X — k is acomplete variety, for every v € Di, there exists a dominant 
canonical k-morphism Spec @, — X. On the other hand, since a base change of 
an étale cover is étale, we have that if Y — X is some finite connected étale cover 
defined by some finite quotient of 2;(X), then Y is integral, and Y xy Spec @, is 
étale over O,. Equivalently, v is unramified in the field extension k(X) — k(Y); 
hence the image of the inertia group 7, in 2; (X) is trivial, etc. 

For assertion (2), first consider a small enough affine open subset Xp C X’ 
such that Xo is regular. Then Xo — k is a quasi-projective regular model for K |k; 
hence zy Dy = mt1(Xo0), by the purity of the branch locus. Therefore, 7 Dxo is 
finitely generated, hence a finite module as Z,, as it is an abelian pro-¢ group. Since 
IT py, — IT\,x is surjective, IT,,x is a finite Zp-module too. But then 


A = ker(IN,, py, > Mx) 


is also a finite Zp-module. Finally, by the definition of I7),x, it follows that for 
every g € A there exists some v € Dy such that g € T,. Since A is finitely 
generated, there exists a finite set © C Dj such that the images of T, (all v € Y) 
in IT\,py, generate A. In order to conclude, consider any quasi-projective normal 
model X — k such that Dy, Y C Dy (hence in particular, Dy, C Dy too). 

Assertion (3) follows immediately from the structure theorem for (the abelian 
pro-£ quotient of the) fundamental groups of a normal curve. 

Finally, assertion (4) follows from the following: By contradiction, let f € Ux 
be a non-constant global invertible section. For every n = ¢° with e > 0, consider 
the normalization X, — X of X in the finite subextension K, := K RF ] of 
K ~ K’. Then since f is a v-unit for all v € Dy, it follows that v is unramified in 
X, — X, and therefore, Gal(K,,| K) is a quotient of [7;,p,. On the other hand, if w 
is a prime divisor of K|k with w(f) > 0, then w is ramified in K,|K for n > 0; 
hence Gal(K,,| K) is not a quotient of [7,,x. Contradiction! Oo 


Recovering function fields from their decomposition graphs 593 


References 


[1] 
[2] 


[3] 


[4] 
[5] 


[6] 
[7] 


[8] 
[9] 


[10] 
{11] 
{12] 
[13] 
14] 
15] 


[16] 
17] 


18] 
19] 
20] 
[21] 


22] 


[23] 


[24] 


[25] 
[26] 


[27] 


Artin, E., Geometric Algebra, Interscience Publishers Inc., New York, 1957 

Bogomolov, F. A., On two conjectures in birational algebraic geometry, in: Algebraic 
Geometry and Analytic Geometry, ICM-90 Satellite Conference Proceedings, ed. A. Fujiki 
et all, Springer Verlag Tokyo 1991. 

Bogomolov, F. A. and Tschinkel, Y., Commuting elements in Galois groups of function 
fields, in: Motives, Polylogarithms and Hodge theory, eds F.A. Bogomolov, L. Katzarkov, 
International Press, 2002. 

Bogomolov, F. A. and Tschinkel, Y., Reconstruction of function fields, See arXiv:math/ 
0303075v2[math. AG] 16O0ct2003, 

Bogomolov, F. A. and Tschinkel, Y., Reconstruction of function fields, Geometric And 
Functional Analysis, Vol 18 (2008), 400-462. 

Bourbaki, N., Algébre commutative, Hermann Paris, 1964. 

Deligne, P., Le groupe fondamental de la droite projective moins trois points, in: Galois 
groups over Q, Math. Sci. Res. Inst. Publ. 16, 79-297, Springer 1989. 

Deligne, P., Letter to Pop, September 1995. 

Faltings, G., Curves and their fundamental groups (following Grothendieck, Tamagawa and 
Mochizuki), Astérisque 252 (1998), Exposé 840. 

Geometric Galois Actions I, LMS LNS Vol 242, eds. L. Schneps — P. Lochak, Cambridge 
Univ. Press 1998. 

Grothendieck, A., Letter to Faltings, June 1983. See [GGA]. 

Grothendieck, A., Esquisse d’un programme, 1984. See [GGA]. 

Kim, M., The motivic fundamental group of P'!\{0, 1, 00} and the theorem of Siegel, Invent. 
Mathematicae 161 (2005), 629-656. 

Kleiman, S. L., The Picard Scheme. Fundamental algebraic geometry, Math. Surveys 
Monogr. 123, 235-321. 

Koenigsmann, J., On the “Section Conjecture” in anabelian geometry, J. reine angew. Math. 
588 (2005), 221-235. 

Kuhlmann, F.-V., Book on Valuation Theory.See: http://math.usask.ca/~fvk/Fvkbook.htm. 
Lang, S., Algebra, Revised third edition. Graduate Texts in Mathematics 211. Springer- 
Verlag, New York, 2002. 

Lang, S., Introduction to Algebraic geometry, Third printing, with corrections. Addison- 
Wesley Publishing Co., Inc., Reading, Mass., 1972. 

Mochizuki, Sh., Absolute Anabelian Cuspidalizations of Proper Hyperbolic Curves, See 
http://www. kurims.kyoto-u.ac.jp/~motizuki/papers-english.html. 

Mumford, D., The Red Book of Varieties and Scheme, LNM 1358, 2nd expanded edition, 
Springer-Verlag 1999. 

Neukirch, J., Uber eine algebraische Kennzeichnung der Henselkorper, J. reine angew. Math. 
231 (1968), 75-81. 

Neukirch, J., Kennzeichnung der p-adischen und endlichen algebraischen Zahlkérper, 
Invent. math. 6 (1969), 269-314. 

Parshin, A. N., Finiteness Theorems and Hyperbolic Manifolds, in: The Grothendieck 
Festschrift II, eds. P. Cartier et all, PM Series Vol 88, Birkhauser Boston, 1990. 

Pop, F., On Alterations and birational anabelian geometry, in: Resolution of Singularities, 
Birkhauser PM Series, Vol. 181, p. 519-532; eds. Herwig Hauser et al., Birkhauser Verlag, 
Basel 2000. 

Pop, F., Recovering K|k from Gx (£), MSRI Talk notes, Fall 1999. See http://www.msri.org/ 
publications/In/msri/1999/gactions/pop/1/index.html 

Pop, F., The birational anabelian conjecture —revisited—, Manuscript, Princeton/Bonn 
2002. See http://www.math.leidenuniv.nl/gtem/view.php 

Pop, F., Pro-€ birational anabelian geometry over alg. closed fields I, See arXiv:math/ 
0307076v1[math.AG]5July2003. 


594 


[28] 


29] 
[30] 


31] 


[32] 


33] 


34] 


[35] 
[36] 


[37] 


38] 


F. Pop 


Pop, F., Pro-£ abelian-by-central Galois theory of Zariski prime divisors, Israel J. Math. 180 
(2010), 43-68. 

Pop, F., Inertia elements versus Frobenius elements, Math. Annalen 438 (2010), 1005-1017. 
Pop, F., On the birational anabelian program initiated by Bogomolov I, Manuscript 2009 
(submitted). 

Roquette, P., Zur Theorie der Konstantenreduktion algebraischer Mannigfaltigkeiten, J. reine 
angew. Math. 200 (1958), 1-44. 

Roquette, P., Nonstandard aspects of Hilbert’s irreducibility theorem, in: Model theory and 
algebra (A memorial tribute to Abraham Robinson), LNM Vol. 498, Springer, Berlin, 1975. 
231-275. 

Saidi, M. and Tamagawa, T., Prime to p version of Grothendieck’s anabelian conjecture for 
Hyperbolic Curves over Finite Fields of Characteristic p > 0, Publ. RIMS Kyoto University 
45, no. 1 (2009), 135-186. 

Szamuely, T., Groupes de Galois de corps de type fini (d’aprés Pop), Astérisque 294 (2004), 
403-431. 

Stix, J., Projective anabelian curves in positive characteristic and descent theory for log- 
étale covers, thesis, Univ. of Bonn, 2002. 

Uchida, K., Isomorphisms of Galois groups of solvably closed Galois extensions, Téhoku 
Math. J. 31 (1979), 359-362. 

Uchida, K., Homomorphisms of Galois groups of solvably closed Galois extensions, J. Math. 
Soc. Japan 33 (1981). 

Zariski, O. and Samuel, P., Commutative Algebra, Vol. II, Springer-Verlag, New York, 1975. 


Irreducible spaces of modular units 


David E. Rohrlich 


In memory of Serge Lang 


Abstract We give a representation-theoretic decomposition of the group of mod- 
ular units of prime level. Apart from the formulation, the results obtained are 
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Introduction 


The group SL(2,Z) of 2 x 2 matrices with integer coefficients and determinant 
1 acts on the complex upper half-plane H by fractional linear transformations, 
and if [ Cc SL(2,Z) is a subgroup of finite index then the orbit space of H 
under I will be denoted Y(I°) and endowed with the unique complex structure for 
which the quotient map z : H — Y(T) is holomorphic. As a Riemann surface, 
Y(T) is noncompact but of finite type: Y(T) = X(T) ~ C, where X(L) is a 
compact Riemann surface and C C X(T) the finite nonempty subset of cusps of 
I. A modular function for T is the pullback under z of the restriction to Y(T) 
of a meromorphic function on X(T’). We denote the field of modular functions for 
T by m’. 
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Now fix I and let O Cc Mt be the subring of holomorphic functions. Its unit 
group O~* will be denoted U, and an element of U will be called a modular unit. 
Thus a modular unit is a modular function which is holomorphic and nowhere 
vanishing on H. If we identify 9’ with the function field of X(T) then U 
corresponds to the multiplicative group of meromorphic functions on X(T) with 
divisorial support in C. Writing C* for the subgroup of constant functions in U, we 
deduce that the free abelian group U/C* has rank at most |C| — 1. 

A theorem of Manin [8] and Drinfeld [2] asserts that the rank is exactly |C|—1 if 
T is a congruence subgroup of SL(2, Z), in other words a subgroup containing the 
kernel of the reduction-mod-N map SL(2,Z) — SL(2,Z/NZ) for some integer 
N = 1. The kernel itself is called the principal congruence subgroup of level N and 
denoted (4), and the proof of the Manin—Drinfeld theorem immediately reduces 
to the case [ = T'(N). It is at this juncture that the work of Kubert and Lang [6] 
enters the picture, providing a refinement of the Manin—Drinfeld theorem which is 
crucial if one is concerned, as we shall be here, with the action of SL(2, Z/ NZ) on 
U arising from the identification SL(2, Z/NZ) = SL(2,Z)/T(N). 

The refinement introduced by Kubert and Lang is a parametrization of the group 
U/C%, or at least of some large subgroup of it, using the “Siegel functions” 


Sal2) = —qre™ 2D (1 — g.) | [GQ — 9242) — 92/42) (1) 


n21 


({6], p. 29, formula K4), where the parameter a = (a),d2) is an ordered pair of 
rational numbers a; and dz, not both integers, while g, = e-" with z € H and 
qr = e' with t = ayz+ ao. Also b = (at — a, + 1/6)/2 and q? = e?7, 
If a;,a2 € N~'Z then gy is in general a modular function only for [(12N7), but 
Kubert and Lang give a criterion for a product of such Siegel functions to be a 
modular function for [(NV), hence by virtue of the product expansion (1) a modular 
unit for '(V). We state the criterion only when N is relatively prime to 6; for the 
general case see [6], pp. 76—78. Suppose that 


e=[Le*. (2) 


where a runs over N~!Z? ~ Z? and m(a) € Z with m(a) = 0 for all but 
finitely many a. (If one prefers, the product can be taken to run over a set of coset 
representatives for Z? in N~!Z? ~ Z?: up to a constant factor, g, depends only on 
a + Z?.) The Kubert—Lang criterion states that g is modular of level N if and only 
if 


S = m(a) = 0 (mod 12) (3) 
and 


SY m(a)(Na\) =) m(a)(Nazy’ = Y m(a)(Na\)(Naz) = 0 (mod N). 
a a a (4) 
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Kubert and Lang refer to (4) as the “quadratic relations,” but they are actually linear 
relations in the exponents m(a). It is only the coefficients of these linear relations 
that are quadratic in the parameter. Be that as it may, let us use the term Siegel group 
for the subgroup of U/C% consisting of the cosets of all products (2) satisfying 
(3) and (4). Kubert and Lang prove that the Siegel group has the maximal rank 
|C| — 1, thereby recovering the Manin—Drinfeld theorem, and in the optimal case 
where N = p” witha prime p = 5 and an integer n = | they show that the Siegel 
group is the full group of modular units modulo constants, so that (2), (3), and (4) 
give a complete description of U/C% ([6], p. 83, Theorem 1.3). 

The present note gives an application of their result. Henceforth p is a fixed prime 
27 and U is the group of modular units for '(p). Also G = PSL(2, F,). We shall 
determine the structure of the natural representation of G on the vector space U/ U? 
over FF ,. Most of what we prove is already contained in the results of Gross [3], but 
this may not be obvious, because our point of view is quite different. In [3] the 
goal is to understand the Galois module structure of the subgroup of the Jacobian of 
X(p) generated by the cusps. Furthermore, the Galois group at issue in [3] is the full 
arithmetic Galois group GL(2, F,,)/{+1} of the cover X(p) > P', where X(p) is 
viewed as a curve over Q(e?7'/”) and P! as the j-line over Q. In the present note, by 
contrast, we consider only the geometric subgroup G = PSL(2, F,) of this Galois 
group, because the applications we have in mind involve regular Galois extensions, 
as in [7] and [9]. This point of view dictates the formulation of our final result, 
which can be summarized as follows: Every irreducible nontrivial representation of 
G over F, occurs with multiplicity one in the maximal semisimple subspace of the 
“noncongruence part” of U/U? (to be defined). 

The proof of this assertion includes a purely representation-theoretic component, 
which is divided into three steps—Proposition 2, Propositions 3 and 4, and 
Proposition 5—and may appear inefficient, particularly in comparison to the proof 
of the corresponding facts for GL(2, F,)/{+1} in [3] (p. 73, Proposition 5.1). On 
the other hand, the three steps do illustrate three different tools—Brauer—Nesbitt 
theory, Frobenius reciprocity, and orthogonal idempotents respectively—while 
Gross’s argument rests on yet a fourth tool, namely highest weight vectors. The 
more important point, however, is that here as in [3], the really crucial input is 
not abstract representation theory but rather the explicit parametrization of modular 
units provided by Kubert and Lang. To appreciate this point it is helpful to recall that 
the natural representation of G on the space of modular forms of weight 2 and level 
Pp was decomposed into irreducibles in two papers of Hecke [4], [5]. As one would 
expect, most of the work in these papers goes into decomposing the space of cusp 
forms, but it is actually the space of Eisenstein series—dealt with by Hecke in a few 
lines—which has some bearing on the present note. The reason is simple: if f ¢ U 
then (d log f)/dz is an Eisenstein series of weight 2 and level p. In fact the space 
of all such Eisenstein series is simply C @z (d log U)/dz. Furthermore, since the 
kernel of f t (d log f)/dz is the subgroup of constant functions C* Cc U?, we 
see that U/U? is isomorphic as an F ,[G]-module to F, ®z (d log U)/dz. Thus the 
representation of G on U/U?” arises via tensor product with F,, from a G-stable 
Z-form of the space of Eisenstein series. It follows that the semisimplification 
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of U/U? can be computed directly from Hecke’s decomposition of the space of 
Eisenstein series into irreducibles. But computing the structure of U/U” itself is 
another matter entirely, and in attempting to do so we will find that the Kubert— 
Lang parametrization of modular units is an indispensable tool. 

Whatever else it may accomplish, the real value of this note to its author is the 
opportunity it provides to acknowledge an enormous personal debt to Serge Lang, 
to whom I owe my career in mathematics. I also take this opportunity to thank the 
referee of [9], whose suggestion for simplifying the proof of Proposition 7 of [9] 
turned out to be an essential ingredient of the present work. 


1 The module of parameters 


The Z[G]-module M introduced below is a first approximation to the domain of 
the Kubert—Lang map parametrizing U. Our goal is to decompose the associated 
representation of G on the vector space V = M/pM over Fp. 


1.1 Preliminaries 


The irreducible representations of G in characteristic p can be classified using 
a single invariant: their dimension. Indeed for each integer k satisfying 0 < 
k < (p— 1)/2 there is an absolutely irreducible representation 0, of G over F, 
of dimension 2k + 1, and o; is unique up to isomorphism. Furthermore, every 
irreducible representations of G in characteristic p is isomorphic to some ox. In 
order to work with an explicit model we shall take ox to be the (2k)th symmetric 
power of the tautological two-dimensional projective representation of G. Then the 
space of o; consists of binary homogeneous polynomials f(x, y) of degree 2k over 
F’,, and the action of G is given by the formula 


(ox(g) f(x,y) = flax +cy,bx + dy), (5) 


where g is the image in G of the element 


~ [ab 
E> (‘ ‘) (6) 
of SL(2, F,). 


Put R = F7, ~ {(0,0)}. We define M to be the free Z-module of rank (p? — 1)/2 
consisting of functions m : R — Z such that m(—r) = m(r) forr € R. An action 
of G on M is given by the formula 


(g -m)(r) = m(rg), (7) 
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where g is either of the two lifts of g to SL(2,F,) and rg is the product of the 
1 x 2 row vector r and the matrix g. Of course this action is formally the same 
as (5), except that m is now an even function R — Z rather than a homogeneous 
polynomial over Fy. 

Given a field F, put Ve = F @z M and extend the action (7) by linearity 
to a representation tr of G on Vr. We can identify Vr with the vector space of 
dimension (p*—1)/2 over F consisting of even functions m : R — F, and then the 
action of G is again formally the same as in (5) and (7). We are primarily interested 
in the case F = F,, and in this case we write Vp and tr simply as V and t. 


1.2 Irreducible constituents 


Write B for the image in G of the upper triangular subgroup of SL(2,F,) and 
N C B for the image of the strictly upper triangular subgroup (i.e., the subgroup 
defined by the conditions c = 0,a = d = 1 in (6)). We denote the trivial one- 
dimensional character of any group by 1, leaving both the group and the implicit 
field of scalars to be inferred from the context. In the following proposition, for 
example, | is the trivial one-dimensional character of N with values in F’, and ind@ 1 
is the representation of G over F which it induces. 


Proposition 1. tz ~ ind 1. 


Proof. Take the space of ind¢ 1 to consist of functions f : G > F satisfying 
fing) = f(g) forn € N and g € G, with G acting by right translation. As 
we have already noted, Vp is also a space of functions, namely the space of even 
functions m : R — F. Furthermore, given f in the space of ind& 1, we obtain an 
element my € Vr by setting m¢(r) = f(g) if eg = +r, where e is the row vector 
(0,1) € R. The map f +> m+ is readily verified to be G-equivariant and injective, 
and its domain and range both have dimension (p? — 1)/2. Oo 


We now take F = F,, and compute the semisimplification of t: 


Proposition 2. The multiplicity of 0, as a constituent of t is 1 ifk = Oork = 
(p — 1)/2 and 2 if1 <k < (p —3)/2. 


Proof. Givent € Fe let a(t) denote the image in B of the diagonal matrix with 
diagonal entries ¢, t~!. The map t +> a(t) induces an isomorphism of quotient 
groups F*/{+1} = B/N, and we can compose the inverse of this isomorphism 
with even powers of the Teichmiiller character w : F* > Zi to obtain characters of 
B. More precisely, we define & : B > Q) (0 <k < (p — 3)/2) by setting 


&(a(t)n) = w(t)* (@ eF*S, neN). 


Then ind? 1 = ae ui "hs whence Proposition | and the identification ind® 1 = 
ind¢ (ind? 1) give 
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19, = O09 Pm (8) 


with a, = ind$& (cf. formula (22) of [4]). We remark that 79 ~ 1 @ ny with an 
absolutely irreducible representation 7 of dimension p over Q,, while if p = 1 
mod 4 then 7p—-1)/4 decomposes over Q, as the direct sum of two inequivalent 
irreducible representations ¢ and ¢’ of dimension (p + 1)/2. Apart from these 
exceptions, the direct summands in (8) are absolutely irreducible (although not 
distinct, as 7%  m(p—1-2«)/2 for 1 < k < (p — 3)/2). 

Put M = Z, ®z M. Then M is a G-stable Z,-lattice in Vg, and V = 
F, ® M. Hence the semisimplification of V can be read from (8) and the mod-p 
decomposition numbers of G. These decomposition numbers are implicit in Brauer— 
Nesbitt [1] (p. 590) and explicitly computed by Srinivasan [10] (pp. 107-108). In 
applying [10], note that form = 1 her ®(ro) and ¢(7o) coincide. Hence taking 
ro = 2k in formula (3.5) of [10], we find that the character of our 2; coincides 
on p-regular conjugacy classes with the sum of the Brauer characters of our ox, and 
O(p—1—2k)/2- In the first instance this conclusion holds only when 1 < k < (p—3)/2 
and k 4 (p — 1)/4, but in fact it holds also when k = 0 (by the first three lines on 
p. 108 of [10]) and when k = (p — 1)/4 (by formula (3.7) of [10]). The upshot is 
that in all cases, the semisimplification of the reduction modulo p of sr, coincides 
with 0, ® 0(p—1—-2k)/2- Hence the proposition follows from (8). oO 


1.3 Irreducible subspaces and quotient spaces 


Next we determine the multiplicity of 0; as a quotient representation of t. Given 
representations a and 6 of a group J on vector spaces Wy and Wg over a field F, 
write Hom y|(a, 8) for Hompyy| (Wa, Wa). 


Proposition 3. For0 < k < (p—1)/2, 
dimer, Home, (cj (t, on) = 1. 
Proof. Proposition | and Frobenius reciprocity give 
Home, [c](t, 0x) = Home, [vj (1, res ox). 


Now N is generated by the element u corresponding to the choicesa = b=d = 1 
and c = 0 in (6), so it suffices to see that the subspace of vectors fixed by 0; (u) 
is one-dimensional. Let A be the matrix of o;(u) relative to the ordered basis 
Dae cama ee yk, and let a;; be the (i, j)-entry of A for 1 < i,j < 2k +1. 
Using (5) to write (ox, (u) f)(x, y) = f(x,x + y), one readily verifies that A is 
upper triangular, that a;; = 1 for all i, and that a;;4,; #4 0 for 1 <i < 2k. It 
follows that the Jordan normal form of A consists of a single Jordan block, whence 
x? is the unique eigenvector of o;(u) up to scalar multiples. oO 
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A similar statement holds for subrepresentations: 


Proposition 4. For0 < k < (p—1)/2, 
dimr,, Hom, {a} (o,,T) = 1. 


Proof. In view of Proposition 3 it suffices to see that both o; and Tt are self-dual. 
The self-duality of o; follows from the fact that irreducible representations of G 
over IF, are determined up to isomorphism by their dimension. The self-duality of 
t follows from the fact that the symmetric bilinear form 


(m,m') = S> m(r)m'(r) (m,m' €V) (9) 
reR 
is nondegenerate and G-invariant. oO 


1.4 Homogeneous components 


Recall that M = Z, ®z M and that o : FP > Zz; is the Teichmiiller character. We 


shall view the elements of M/ as even functions m : R — Z,. We define M*®) cM 
to be the Zp-submodule consisting of all m € M such that 


m(tr) = w(t)*m(r) 


fort € Fe andr = (r1,72) € R, where tr = (tr, tr2). The linear endomorphisms 
e of M given by 


(em)(r) = al S> ow *(t)m(tr) (10) 


x 
teF* 


(0 <k < (p —3)/2) form a family of orthogonal idempotents projecting M onto 
the respective submodules M“) and summing to the identity, so we have 


M= ee? M®, (11) 


In fact (11) is a decomposition into Z,[G]-submodules, because the idempotents 
e“ commute with the action of G. Hence the space of t likewise decomposes into 
G-stable subspaces: 

v=oe) ve (12) 


with V = F,@z, M“. Let r“) denote the representation of G on V“. We have 
assumed that p = 7 rather than merely p = 5 to ensure the validity of the following 
statement (and several others hereafter): 
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Proposition 5. If 1 < k < (p — 3)/2 then t™ has a unique irreducible 
subrepresentation and a unique irreducible quotient representation, and they are 
equivalent to og and O(p—1—-2)/2 respectively. On the other hand, tT ~ oo ® 


O(p—1)/2- 


Proof. The first point is that the free Z,-module M) has rank p + 1. Indeed for 
each of the p + 1 lines ¢ through the origin in F%,, fix an element rg € R which 


spans £, and define a function fy, € M“ by 


2k = ¢ x 
o(t) ifr = tre witht € F, 


Ta ifr é 2. 


For fixed k the p+1 functions f¢ ; have pairwise disjoint supports and are therefore 
linearly independent over Z,,. Hence M“) has rank at least p + 1. But M has rank 
(p + 1)(p — 1)/2, so we deduce from (11) that M“ has rank exactly p + 1, as 
claimed. 

It follows that V“) has dimension p + 1 over F p- But an irreducible represen- 
tation of G over F,, has dimension < p, so V“) has a proper irreducible subspace 
and hence at least two irreducible constituents. On the other hand, V has exactly 
p — 1 irreducible constituents (Proposition 2), so we deduce from (12) that V“ has 
exactly two constituents. 

To identify these constituents up to isomorphism, we introduce a Z[G]-submodule 
Nz of M for 0 < k < (p — 3)/2. Givenm € M, let m : R — F, denote the 
reduction of m modulo p. We define Vj. C M to be the submodule consisting of all 
m such that m7 coincides with a binary homogeneous polynomial of degree 2k over 
F,. Strictly speaking, we should say “coincides with the function R — F’,, defined 
by” such a polynomial, but the distinction is moot: a homogeneous polynomial of 
degree < p which vanishes on R is zero. Thus the map m +» m determines an 
embedding of Ny /(Ni MO pM) into the space of o;. In fact this embedding is 
surjective and hence a G-isomorphism, because any even function R — F, can 
be lifted to an even function R > Zp. 

Now put Ae = EON, (0 < 1 < (p —3)/2). It is readily verified that if 
| ~ k then the image of Ne under m +> 7™ is {0}. On the other hand, we have 
just seen that the map m +> 77 gives a G-isomorphism of Nj. /(N%.M pM) onto the 
space of ox. It follows that the domain of this G-isomorphism can be replaced by 
NOE INO AN pM), But the latter can be viewed as a G-stable subspace W of 
Vv“), and the representation of G on W) is therefore equivalent to o;. Furthermore, 
we have seen that V“) has exactly two irreducible constituents, so the quotient 
V“ /W is also irreducible. Since its dimension is (p+ 1)—(2k +1) = p—2k, we 
deduce that the quotient representation is equivalent to 0(,—1—2,)/2. In summary, the 
representation of G on W and on V / W“ is equivalent to 0, and to 0(p—1—24)/2 
respectively. 

To see that 1 = oo ® o(p—1)/2, we observe that the set of indices k satisfying 
1<k < (p—3)/2is stable underk + (p—1—2k)/2. It follows that o9 and o(p—1)/2 
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occur as constituents of V“ if and only if k = 0. On the other hand, oo and 0(p-1)/2 
occur not merely as constituents but as subrepresentations of t (Proposition 4). It 
follows that they occur as subrepresentations of 1 whence t X ay @ O(p—1)/2- 
Finally, suppose that 1 < k < (p — 3)/2. If W is an irreducible subspace of 
V) then the representation of G on W is equivalent to an irreducible constituent of 
t), hence either to o; or to O(p—1—2k)/2- But if W A W) then the first possibility 
is excluded, because ox, occurs as a subrepresentation of t with multiplicity one 
(Proposition 4). As for the second possibility, it coincides with the first (and is 
therefore excluded when W 4 W“)) if k = (p — 1)/4. Otherwise it is excluded 
by Proposition 4 again, because 0()—1—2)/2 already occurs as a subrepresentation 
of r(P-!~24)/2), and the spaces V(?—!~24)/2) and V are linearly independent. We 
conclude that W“) is the unique irreducible subspace of V“), and since V“) has 
just two irreducible constituents it follows that V“) /W“) is the unique irreducible 
quotient. Oo 


2 The quadratic relations 


To move a step closer to U we turn from M to the Z[G]-submodule Q of M defined 
by the quadratic relations of Kubert and Lang. As before, our primary concern is the 
representation of G on the associated vector space over F,, which is now the space 


V' = Q/pQ. 


2.1 Preliminaries 


To define Q, recall that givenm € M we write m : R — F,, for the reduction of m 
modulo p. We will also let N denote the Z[G]-submodule of M consisting of all n 
for which 7 has the form 


A(r) = arj + brir, + cr; (13) 


with a,b,c € F,, where r = (1,12). Since N is a Z-form of the Z,[G]-module 
previously denoted Nj, it might be more logical to denote it Nj, but for simplicity 
we omit the subscript (and thereby void our previous use of N as a positive integer 
or as the subgroup of G corresponding to strictly upper triangular matrices). We 
define Q to consist of those m € M such that 


YS m(rya(r) = 0 (14) 


reR 


foralln e N. 
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It is immediate from this description that Q contains pM. Thus M/Q is a 
quotient of the finite-dimensional vector space V = M/pM over F>. In fact 
since Q is defined by the vanishing of three linearly independent linear forms on 
M/pM (namely those corresponding to the choices (a,b,c) = (1,0,0), (0, 1,0), 
and (0,0, 1) in (13) and (14)) we see that M/Q has dimension three over Fp. 
In particular Q has finite index in M, so by the Brauer—Nesbitt theorem, the 
representation t’ of G on the space V’ = Q/pQ has the same semisimplification 
as t. Thus Proposition 2 holds with t replaced by t’. This is not quite true of 
Proposition 5, however, and our next proposition gives an analogue of Proposition 5 
valid for t’. 


2.2 Homogeneous components 
Put O = Z, ®z Q. Then Q is stable under e“ (cf. (10), (13), and (14)). Hence 
o = ola 
with OQ” = e“Q, Thus putting yes QO) /pQ® we have 
Vv’ a aw (15) 
a decomposition of V’ into G-stable subspaces. Let t’ denote the representation 


of G onv’™, 


Proposition 6. Jf 1 < k < (p —5)/2 then v’ has a unique irreducible 
subrepresentation and a unique irreducible quotient representation, and they are 
equivalent to 0% and 0(p—1—2k)/2 respectively. On the other hand, JO ~ 00 B 


—3)/2 
O(p—1)/2 and 7! )/2) = 01 ® Op-3)/2. 


Proof. Suppose first that k 4 (p—3)/2. We claim that M“ C Q, whence M“ = 
oO). To see this, take m € M“) andn € N, and write 


Yimoacr) = >> YD mera), 


reR LEA reRne 


where A is the set of lines through the origin in F2. For each £ € A choose a vector 
re € R spanning £. Then the inner sum on the right-hand side can be written as a 
sum over t € he with r = tre. The homogeneity of m and 7 then gives 


Yimoyar) = Voim(roya(rd) YO k*?. 


reR leA tery 


Since k # (p — 3)/2 the exponent of ¢ on the right-hand side is < p — 1 and 
consequently the inner sum is 0. Thus M” © OandM” = OM) as claimed. 
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It follows that if k # (p — 3)/2 then z/“) ~ r(*), whence the assertions at hand 
reduce to those of Proposition 5. To handle the remaining case k = (p — 3)/2, 
we recall that t and t’ have isomorphic semisimplifications and are direct sums of 
their respective homogeneous components t? and t’ ® Since! ~ r+ fork x 
(p — 3)/2, we deduce that the semisimplifications of t/?~/?) and 1?-3)/2) are 
likewise isomorphic. Thus by Proposition 5, t’ (3/2) has exactly two irreducible 
constituents, namely 0(,—3)/2 and 0}. 

Now M and Q are also the direct sums of their homogeneous components 
MM“) and Q“, and we have seen that the vector space M/Q = M/Q has 
dimension three over F, (cf. (13) and (14)) while Mo = Q® fork # 
(p — 3)/2. Consequently M‘?—)/2) /Q\P—)/2) is also three-dimensional over F ,, 
as is therefore the subspace Y = pM ?~3/2)/pQ(?-3/2) of V"P-H/)_ Since 
7/9?) has just the two irreducible constituents 01 and 0()—3)/2 of dimensions 3 
and p — 2 respectively, we deduce that the representation of G on Y is o;. Thus 
0 is a subrepresentation of t’ (2-3/2) and O(p—3)/2 18 the corresponding quotient 
representation. 

It remains to see that o; is also a quotient representation of t , whence 
0(p—3)/2 1S a Subrepresentation and P31) 01 ® O(p—3)/2. To this end, consider 
the bilinear pairing < *,* >: O x N — Z given by 


((p—3)/2) 


<mn>= ~ Dmeyncr) (me Q, neN). 


reR 
Write L for the Z[G]-submodule of Q consisting of those m such that 
<m,n >= 0 (mod p) 


foralln € N.Put L = Z,@zL. Then L is stable under e“?, so putting £L = eL 
we have 


£= OPP? L, 


We claim that £“?-9)/?) contains pQ'?—>)/?) and that the quotient space Z = 
Q(P—3)/2) 1 £\(P—3)/2)_ of Vie? is of positive dimension < 3. An immediate 
consequence of the claim is that the representation of G on Z is equivalent to 0, so 
verifying the claim will complete the proof. 

It is immediate from the definitions that L contains pQ and hence that £ contains 
pQ. On the other hand, £ does not contain pM: for ifm € M is the function taking 
the value 1 on (+1, 0) and 0 elsewhere then < pm,n ># 0 mod p for any n € N 
satisfying (13) with a 0. It follows that for some k with 0 < k < (p —3)/2 we 
have pM“ ¢ £. But we have seen that pQ C Land that po” = pM for 
k # (p—3)/2. Hence £L°?—9)/”) does not contain pM‘?—>)/?), and we deduce that 
L(P-3)/2) | pQ(P-3)/2) ig a subspace of V"?— >”) of positive codimension. On the 
other hand, the codimension is < 3, because the subspace is defined by the vanishing 
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of three linear forms on V’"(?~9)/?), namely the forms sending m + pQ?->)/? to 
< m,n > modulo p with 7 as in (13) and (a, b,c) = (1,0, 0), (0, 1, 0), and (0, 0, 1). 
Our claim follows. oO 


3 The Kubert-Lang map 


Given a matrix y € SL(2,Z), we identify its image y € PSL(2,Z) with the 
fractional linear transformation of H defined by y. Thus if f is a function on H and 
y is the right-hand side of (6) then f oy is the function z+» f((az+b)/(cz+d)). 
Since the image of I'(p) in PSL(2, Z) has quotient G and fixes the elements of U 
we can make U into a Z[G]-module by setting 


g-f=foy™ 


for g € Gand f € U, where y € PSL(2, Z) is any lift of g. The resulting 
representation of G on the vector space V” = U/U? over F,, will be denoted 1”. 

Given a € p 'Z? ~ Z”, define the Siegel function g, as in (1). Forr € R we 
put f, = g!?, where a € p~'Z? is chosen so that r coincides with the residue class 
of pa modulo pZ?. Since a can be replaced by any element of the coset a + Z?, 
the function g!? is determined only up to multiplication by a pth root of unity ((6], 
p. 28, Formula K2), but the coset f,U” is uniquely determined by r because U? 
contains C*. Furthermore, if m € Q then the function 


? — I] yo) 


reR 


belongs to U (cf. (3) and (4), or in other words [6], p. 76, Theorem 5.2). Hence the 
assignment m + pQ + f’"U? defines an F,-linear map ®: V’ > V”. 


Proposition 7. The map ® is surjective with one-dimensional kernel, and it 
intertwines t' with tv". 


Proof. The argument echos the proof of Proposition 0 of [9], which in turn merely 
assembles a number of results from [6]. Let us at least recall the relevant citations: 
The surjectivity of ® follows from [6], p. 83, Theorem 1.3, because p is prime to 12 
and thus the map fU? +> f!?U? is an automorphism of U/U”. That the kernel 
of ® is one-dimensional follows from the surjectivity, because V’ has dimension 
(p? — 1)/2 over F, while V” has dimension (p? — 3)/2 ([6], p. 42, Theorem 3.2). 
Finally, the G-equivariance of ® follows from [6], p. 27, Formula K1. oO 


Put V" = ov’), so that 


MW __ a(P-3)/2 p(k) 
Vo =@p29 Vo. 


We write t”? for the representation of G on V"“), 
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Proposition 8. If 1 < k < (p —5)/2 then t”™ has a unique irreducible 
subrepresentation and a unique irreducible quotient representation, and they are 
equivalent to 0% and 0(p—1—2k)/2 respectively. On the other hand, HO x O(p—1)/2 


and g(P—-3)/2) = 01 ® Op-3)/2. 


Proof. Combine Propositions 6 and 7 and observe that V’ has exactly one G-stable 
subspace of dimension one. oO 


We conclude with some remarks which will lead to a slight reformulation of 
Proposition 8. Since p > 7, the two direct summands of 1” (2-9)/?) are inequivalent, 
so there is a unique subspace W”?~9)/?) of V”“?-9)/?) on which the representation 
of G is equivalent to 0(,—3)/2. We shall refer to the subspace 


vi" 


_ (@?v") @ wi P-3)/2) 
non —_ k=0 
of V” as the noncongruence part of V". The congruence part of V” is the unique 
subspace Vijyg of V" ((P~3)/2) on which the representation of G is equivalent to oy. 
Thus 

V" = V2, @ Vong (16) 


To explain the terminology, write & for the field 9’) of modular functions for 
I'(p), and given a subspace W of V” write Rw for the Kummer extension of & 
obtained by adjoining the pth roots of all f € U such that fU? € W. (Note that 
RP 1 U = U?, whence Gal(Ry /R) is dual to W under the Kummer pairing. In 
particular, [Rw : R] = |W].) Also put 


eons _ 'e mr) 


N21 


so that 2cong is the compositum of the modular function fields for all congruence 
subgroups of SL(2, Z). We claim that 


Ry M Mong = Ryy 


cong" 


(17) 


Together, (16) and (17) justify the designation “noncongruence part” for V,”... 

To prove (17), we recall from the proof of Proposition 6 that the subspace of 
yi )/?) on which G acts via 0; is PM/pQ (strictly speaking we should identify 
this subspace as pM (?—9)/?) / pQ?—)/2),_ not pM/pQ, but M” = OW fork # 
(p — 3)/2). Thus ®(pM/pQ) = Ving. It follows (see [9], Proposition 2, p. 12) 


that RV! 0 is the field of modular functions for [(p), whence the right-hand side of 
(17) is contained in the left-hand side. For the reverse inclusion, put 


l=ty €SLQ,Z): foy = f forall f € Ry” ON Meong}- 
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Then the field of modular functions for T’ is the left-hand side of (17). In particular, 
since the left-hand side of (17) is a subfield of Dcong it follows that I is a congruence 
subgroup. But the least common multiple of the cusp amplitudes of I" divides p?, 
because the field Ry” is generated over & by pth roots of elements of R. Thus the 
Wohlfahrt level of I divides p?, and since I is a congruence subgroup its Wohlfahrt 
level equals its congruence level by the Fricke—Wohlfahrt theorem [11]: [(p?) Cc I. 
Taking modular function fields of the two sides reverses the inclusion and thus gives 
the inclusion of the left-hand side of (17) in the right-hand side. 

Now put W” = Vv”) and for 1 < k < (p—5)/2 let W”™ be the unique 
irreducible subspace of V” Then the maximal semisimple subspace or socle of 
Vion iS eo.” wand we may describe its structure as follows: 


Proposition 9. The representation of G on the maximal semisimple subspace of 


V,",, is equivalent to ae ap. 
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1 Introduction 


The usual Weil height of a rational number x/y, where x and y are integers without 
a common prime factor, is defined as 


h(x/y) = log max(|x|,|y|). 


More generally, one can define the usual Weil height /(f) of an algebraic number 
f ina number field K by summing log max(|A|,, |1|) over all of the absolute values 
v of K. Mahler [Mah60] has proved that if F is a nonzero irreducible polynomial in 
Z\t] with coprime coefficients such that F(8) = 0, then 


1 
deg(F)h(B) = i log | F(e27'")|d0. (1.1) 
0 


The quantity ie log | F(e27')|d6 is often referred to as the Mahler measure of F. 
It is easy to see that h(B?) = 2h(B) for any algebraic number f. Similarly, it is 
easy to check that for any continuous function g on the unit circle, we have 


1 1 
/ g((e27'*)?)do = / g(er"!*) dé. 
0 0 


Furthermore, the unit circle is the Julia set of g : x > x?. Thus, Mahler’s formula 
says that one obtains the height of an algebraic number by integrating its minimal 
polynomial against the unique measure jz such that g,j4 = jy and jz is supported on 
the Julia set of @. 

Now let 9 : lee — Po be any nonconstant rational map. Brolin [Bro65] and 
Lyubich [Lyu83] have constructed a totally y-invariant probability measure jv, (that 
is, we have ~,j4 = j4) with support on the Julia set of g; Freire, Lopes, and Mafie 
[FLM83] have demonstrated that this measure is the unique totally g-invariant 
probability measure with support on the Julia set of g. When ¢ is defined over a 
number field K, Call and Silverman [CS93] have constructed a height function hy 
with the properties that (1) h,(y(x)) = (deg y)h,(x) and (2) there is a constant 
Cy such that |h(x) — hy(x)| < Cy for all x € P!(K). In [PST04], it is shown that 
Mahler’s formula (1.1) generalizes to the adelic formula 


(deg Fyhg(x) = 0 i log | Fly dite. (1.2) 


places v of K (Cy) 

where # is an algebraic point, F is a nonzero irreducible polynomial in Q[r] 
such that F(8) = 0, the measure jy, at an archimedean place is the totally g- 
invariant probability measure constructed by Brolin and Lyubich, and the integral 
Sore,) log |F|, dg, at a finite place v is defined so that its value is the v-adic 
analog of the value at an archimedean place (note that as defined in [PST04], these 
are not integrals per se). Favre and Rivera-Letelier have also given a proof of 1.2, 
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using actual integrals on Berkovich spaces; Pineiro [Pin05] and Chambert-Loir and 
Thuillier [CLT04, Thu06] have recently proved higher-dimensional generalizations 
of 1.2. 

Lyubich [Lyu83] has also proved that for any continuous function g and any 
archimedean place v, the integrals Spuc,) g djty, can be computed by averaging g 
on the periodic points of g; that is, 


1 
lim ———— g(w) = [ g dig». (1.3) 
k—roo (deg p)k dX PI(C,) . 


ok (w)=w 


Autissier [Aut01], Bilu [Bil97], Szpiro, Ullmo, and Zhang [SUZ97], and others have 
obtained generalizations and variations of this result. The most recent generaliza- 
tion, proved independently by Baker and Rumely [BR06], Chambert-Loir [CL06], 
and Favre and Rivera-Letelier [FRLO4] and [FRL06] states that (1.3) continues to 
hold when the periodic points w such that p(w) = w are replaced by the conjugates 
of any infinite nonrepeating sequence of algebraic points with height tending to 0 
and when the measure j1,,y is the totally y-invariant measure without point masses 
at classical points on the v-adic Berkovich space (see [Ber90]) for a finite place v. 

The function log|F|, for F a nonconstant polynomial, is not continuous in 
general, of course. Thus, the equidistribution results cited above do not allow us 
to compute Mahler measures by averaging log | F'|, over points of small height. One 
can, however, show that for any B € Q, we have 


1 ! 
(Q(B) : Qh) = tim =; > tog FEI = f tog iF), 4 
& 


n=] 


where F is a nonzero irreducible polynomial in Z[t] with coprime coefficients such 
that F(B) = 0 (see [EW99, Chapter 1], [Sch74]). Everest, Ward, and Ni Fhlathtin 
have proved similar results for maps that come from multiplication on an elliptic 
curve [EW99, Chapter 6], [EF96]. The proofs of these results make use of the 
theory of linear forms in logarithms [Bak75], [Dav95], which is used to show that 
the periodic points of the maps in question have strong Diophantine properties. It 
is not clear how to apply the theory of linear forms in logarithms in the case of 
more general rational maps. In this paper, we use Roth’s theorem [Rot55] from 
Diophantine approximation in place of the theory of linear forms in logarithms. 
This allows us to work in greater generality. 


1.1 Statements of the main theorems 


The main results of this paper extend (1.4) to a formula that holds for all rational 
maps. Let K be a number field or a function field of characteristic zero, let v be a 
place of K, and let g : Py —> PX be a nonconstant rational map of degree d > 1. 
We prove the following equidistribution result for the periodic points of 9. 
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Theorem 5.7. For any nonzero polynomial F with coefficients in K, we have 


. 1 
foucyioblFhdise= lim. ge Del Fone 
Ply) g* (wil)=[w:1] 

F(w)40 


This allows us to show that for any point 6 € K, the canonical height /,(8) can 
be computed by taking the average of the log of the absolute value of a minimal 
polynomial for 6 over the periodic points of g. 


Theorem 5.10. For any B € K and any nonzero irreducible F € K{t] such that 
F(B) = 0, we have 


(deg K) (deg F)(hg(B) — Ng(oo)) 


= LY tmz Y _ bsiFwh. 


places v of K ¢* (tw:1)=[w:1] 
F(w)40 


In both the theorems, the w are counted with multiplicity. We explain what 
multiplicity means in this context in Section 1. 

We are also able to prove that fue.) log |F |, dug, is the limit as n goes to 
infinity of the average of log | F'|, on the points w for which g”(w) = a, where a is 
an algebraic point that is not an exceptional point for g. We state this in Theorem 
5.6. This enables us to prove Theorem 5.9, which is the analog of Theorem 5.10 for 
the points w such that g”(w) = a. 

The strategy of the proof of the main theorems is fairly simple. By additivity, it 
suffices to prove our results for polynomials of the form F(t) = t — B for B € K. 
After Section 3, we are reduced to showing that 


a. _ logmax(|Px(B, 1)|». |Qx(B, Iv) 
We, 2, PEWS P= ee dk 
¢* (tw:1)=[w:1] 


Ww 


i log max(| Px (1, 0)|,, |Q« C1, 9) |») 
— lim : 
k—>oo dk 


(1.5) 
where ¢* is written as 
g* [To : Ti]) = [Pe (To, Ti) : Ox (To. T1)] 


for coprime homogeneous polynomials Py, and Q, in K[To, 7;]. The points w for 
which y*(w) = w are just the solutions to the equation Px (w, 1) — wOx(w, 1) = 0. 
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Thus, we get the left-hand side of (1.5) by taking the limit of log|P,(f, 1) 
— BOx(B, 1)|,/d* as k goes to oo. For each k, we rewrite this as 


Px(B.1) 
log |Qx(B, I)» log | Se? — pl 
dk + dk 


and use Diophantine approximation to show that the second term in the equation 
above usually goes to 0 as kK — oo; our theorems then follow after a bit of 
calculation. The Diophantine approximation result we use is Roth’s theorem, which 
we State in Section 4.3 as theorem 4.1. We use Roth’s theorem to derive Lemma 
5.2, which is the key lemma in our proofs of the main theorems. The idea for the 
proof of Lemma 5.2 comes from Siegel’s famous paper [Sie29]. We should note 
that after writing this paper we discovered that Silverman [Si193] has used methods 
very similar to those found here at the beginning of Section 5; we require a slight 
modification of his results along these lines, however, so we present the necessary 
argument here in full. 

Propositions 5.4 and 5.5 deal with the additional complications that may arise 
when the 6 in (1.5) is preperiodic. These complications are overcome with 
somewhat lengthy — but essentially basic — calculations that are very similar to some 
of the computations carried out by Morton and Silverman in [MS95]. 

In Section 6, we construct a simple counterexample that shows that Theorem 5.7 
will not hold in general when the polynomial F does not have algebraic coefficients 
(it is likely that the theorem will also fail if the point a is not algebraic). We construct 
a transcendental number f such that the limit limy—o F Dik ay log |& — B| does 
not exist. This means that there is no way to prove the main results of this paper 
without using some special properties of algebraic numbers. 
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C. Petsche, R. Rumely, and S. Zhang for many helpful conversations. In particular, we thank 
M. Baker, L. DeMarco, and R. Rumely for suggesting some of the applications mentioned in 
Section 7. The first author was partially supported by NSF Grant 0071921. The second author was 
partially supported by NSF Grant 0101636. 


2 Notation and terminology 


We fix the following notation: 


¢ K is a number field or a function field of characteristic 0 (by function field we 
mean a finite algebraic extension of a field of the form Keons(T.) where Kons is 
algebraically closed in K); 

e« visaplace of K; 

e K,, is the completion of K at v; 

¢ C, is the completion of an algebraic closure of K,, at v; 
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* K is the algebraic closure of K in C, (note that this means that v extends to all 
of K); 

° ny, = [K,: Q,]if K is a number field; 

e ny, = 1if K isa function field; 

¢ deg K =[K : Qlif K is a number field; 

e deg K = 1if K isa function field. 


We let | - |, be an absolute value on C,, corresponding to v. When K is a function 
field and z, generates the maximal prime M, in the local ring 0, corresponding to 
v, we specify that 


| yy = e Wov/My): Keons] 


where Kons 18 the field of constants in K. When K is a number field and v is 
nonarchimedean, we normalize | - |, so that 


—ny, 


Iply = Pp 


when v lies over p. When K is a number field and v is archimedean we normalize 
so that | - |, = |-|”" on Q, where | - | is the usual archimedean absolute value on Q. 

Throughout this paper, we will work with a nonconstant morphism ¢ : PL o> 
Pk of degree d > 1. We choose homogeneous polynomials P,Q € K[To, Ti] of 
degree d without a common factor along with a coordinate system [s : t] for PE 
such that 


g([To : Ti]) = [P(T0. M1) : Q(T, T1)I, 
where P and Q have no common zero in P!(K). We let P} = P and Q; = Q, and 
fork > 2 we define P;, and Q; recursively by 


Px(To, T1) = Pe-1(P (To, T1), Q(To, T1)) 


and 
Ox(To, T1) = Ox-1(P (Zo. T1), O(T0, 1). 


Having chosen coordinates, we can define the usual Weil height as 


1 
h(a: 3) = aoe Pe max(|a|,, |b|») 


when a,b € K. When a and b lie in an extension L of K, this definition extends to 


1 


WG OS Rides 


[Ly : Ky] log max(|a|y, |b|w), (2.1) 


places w of L 


where L,, is the completion of L at w and the absolute value | - |,, restricts to some 
|-|,on K. 
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As in [CS93], we define the canonical height hy as 


h(y* ([a : b))) 
ae (2.2) 


he({a : b)) = lim 


We say that a € P!(K) is a periodic point for ¢ if there exists a positive integer 
n such that y” (a) = a. If @ is periodic, we define the period of a to be the smallest 
positive integer £ such that y’(@) = a. We say that a is perperiodic if there exists a 
positive integer n such that p”(q) is periodic. 

We will use a small amount of the theory of dynamics on the projective plane; for 
a more thorough account of the subject, we refer the reader to Milnor’s [Mil99] and 
Beardon’s [Bea91] books on the subject. We say that a € P!(K) is an exceptional 
point for g if y?(~) = @ and g” is totally ramified at a. This is equivalent to saying 
that the set Sar (a) is finite (see [Bea91, Chapter 4.1]). If @ is exceptional, 
then at each place v, there is a maximal v-adically open set U/ containing a such that 
the sequence (y“*(B)); converges to a for each B € U, where £ is the period of a 
(which is either 1 or 2). We call U the attracting basin of a (see [Bea91, Chapter 
6.3], which uses the terminology “local basin’). 

We always count points with multiplicities in this paper. The multiplicity of a 
point [z : 1] in the multiset {w | y*(w) = w} is the highest power of ¢ — z that 
divides the polynomial P; (¢, 1) — tQ;(t, 1). The multiplicity of a point [z : 1] in 
the multiset {w | g“(w) = [s : u]} is the highest power of t — z that divides the 
polynomial uP; (t, 1) — sO;(t, 1) (here s, u, and z are taken to be elements of K, 
while ¢ is taken to be a variable). 

We note that everything done in this paper depends upon our choice of coordi- 
nates. In particular, our integrals are closely related to the canonical local heights 
(see [CG97]) for the point [1 : 0] at infinitely, so our choice of the point at infinity 
affects all of our integrals. To emphasize the fact that we treat [1 : 0] as the point at 
infinity, we denote it as oo where appropriate. 


3 Brolin—Lyubich integrals and local heights 


We will work with the limits 


_ logmax(|Px(a, b)|,, |Ox(a, d)|,) 
lim 
k->oo dk 


(3.1) 


for (a,b) € C, \ {(0, 0)}. For a proof that these limits exist, see [PST04], [BRO6], or 
[CG97] (the proof is essentially an exercise in using telescoping sums and geometric 
series). Note that Call and Goldstine [CG97, Theorem 3.1] have shown that 


log max(| Px (B, 1) |v. |Ox«(B, Dl) 
d* 


ho» ((B : W) = Jim, 
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is the unique Weil function for [1 : 0] at v (see [Lan83, Chapter 10] for a definition 
of Weil functions) that satisfies 


how (p([a : b))) = dhp,{fa : b)) + log|O (5.1) . 


for any [a : b] ¥ [1 : 0] (see [CG97, Theorem 2.1]). The function hov() is called a 
canonical local height for g. 

As noted in the introduction, Brolin [Bro65] and Lyubich [Lyu83] have con- 
structed a totally g-invariant measure j1,,, with support on the Julia set of ¢, when 
v is an infinite place (see also [FLM83]). More recently, Baker and Rumely [BRO6], 
Chambert-Loir [CLO6], and Favre and Rivera-Letelier [FRLO4] and [FRL06] have 
constructed a g-invariant measure jy , on the Berkovich space associated to P!(C,); 
this is the unique g-invariant measure without point masses at classical points on the 
v-adic Berkovich space (see [Ber90]) for a finite place v. 


Proposition 3.1. Let v be a place of an algebraically closed field C, that is 
complete with respect to v, and let F(t) = t — B for B € C,. Then 


log |Flyditey = lim LOB MAX Pe(B, Dl, 1QeB, Yh) 
P'(C,) k—-o0o 
l Px 1, vo C 1,0 v 
7 og max(| Px (1, 0)|,, |Ox( bv) ais 
k—>0oo dk 


Proof. We will prove this following the methods of Baker and Rumely [BR06, 
BRIO]. The proposition could also be proved using the work of Favre and Rivera- 
Letelier [FRLO6] or Chambert-Loir and Thuillier [CLT04,Thu06], who proved more 
general Mahler formulas (but do not formulate them in terms of limits such as (3.1)). 
In [BR06], Baker and Rumely show that for w € C,, the function H,, defined by 


log max(| Px (1, 0)|,, |Q«(1, 0)|,) 
dk 


Hi, ({a : b]) = —log|wb — a], + lim 
k—o0o 


is subharmonic on P!(C,) \ {[w : 1]}. Furthermore, if A is a suitably normalized 
distributional Laplacian (i.e., a suitable multiple of —dd° considered in the distribu- 
tional sense, which can be extended to the setting of Berkovich spaces as described 
in [BR10]), then 


AA, = —Hov +e bw (3.3) 
where 6,, is the usual Dirac point mass at w. Similarly, we have 
A log |t — Bl, = 51:0) — 5g 
(see [FRLO6, Section 5.1] or the same reasoning that gives (3.3)). Now, since 


log |t — B|, and Hy, are both subharmonic on P!(C,) \ {[1 : 0], [w : 1],[B : 1}, 
we have 
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/ gl’ Blidiive = (/ log |t — Bly (-an,)) ii5g =, 
PI(C,) PI(C,) 


= (/ Hy (—A log |t - Bi) + log |w— Bly 
PI(C,) 


= A, ([B : 1]) — Ay([1 : 0]) + loglw— Bly, G.4) 


by the self-adjoint property of A (which follows from [BR10, Proposition 5.28] 
when v is nonarchimedean and is simply integration by parts when v is archimedean). 
Thus, we have 


log max(| Px (B, I)|v, |Ox(B, Vv) 


—log|w— Bly + lim TE + log |1| 
1 P,(1, 0)I,, 1,0)|, 
— jim SMUT Vb 1OeA OW iGo jy — Bh, 
k->0o dk 
ij log max(| P (B, |v, |Ox(B, Dv) 
= lim 
k—oo dk 
fim LOSMaX(| Pe(1, O)ly-1 Qe. Oh) 
— lim : 
k->0oo dk 
as desired. oO 


Note that although our integrals are defined for points in C,, the results we prove in 
Section 5 only apply to points in K, where K is a number field or function field of 
characteristic 0. 


4 Preliminaries from Diophantine approximation 


The following well-known theorem of Roth [Rot55] is the principal tool from 
Diophantine approximation that is used in this paper. 


Theorem 4.1. (Roth). Ifa € C is algebraic over Q, then for any € > 0, there is a 
constant C such that 
a | C 


a--|>—_, 
| b |b[2+¢ 


for alla/b € Q such that a/b # a. 


We will need to work in slightly greater generality. In the terminology of the 
previous section, Roth’s theorem admits the following generalization (see [Lan83, 
Theorem 7.1.1]), which holds when K is number field or a function field of 
characteristic 0. 
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Theorem 4.2. Let a ,,..., 0, be elements of K and let L C K be a finite extension 
of K. Then, for any € > 0 and any places v of K and w of L such that w|v, we have 


1 


IL: Klee K) S > max(0, —log ja; — Bll) < (2+ eh(B) + O(), 


i=l 


forall B € L not in the set {a,..., Qn}. 


Let [a : 1] be a point in P!(K). Then for any [b : 1] 4 [a : 1] in P'(C,), we let 
Ata:},v((b : 1]) = max(— log |b — al, 0). 
We extend this definition to the point at [1 : 0] by letting 
Ma:tjv(f1 : O)/) = 0 


and 


Ati:ov([b : 1]) = max(0, log |b],). (4.1) 


We will work with divisors on PE rather than elements of K. Let D = 
y=, mia, where a; € P'(K) and m; € Z. We let 


Ap y(B) = D> mide v(B) 


for points B € P!(C,) that are not in Supp D. Then Ap, is a Weil function for D 
at v as defined in [Lan83, Chapter 10]. It is easy to check that for any divisor D and 
any rational map ¢ on P', we have 


Apv(Q(B)) = Agx p(B) + OW), (4.2) 
for all 8 € P!(K) away from the support of D and y* D. This is a general functorial 
property of Weil functions, as explained in [Lan83, Chapter 10]. 

For a divisor D = )~/_, mja;, where a; € P'(K), we define 


r(D) = max(m;). 


With this terminology, it follows from Theorem 4.2 that for any « > 0, any finite 
extension L of K, and any positive divisor D on P!(K) with r(D) = 1, we have 


IL: K](deg Ky?” < (2+ €)h(B) + O(1) 
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for all 8 ¢ P'(L) away from the support of D. Hence, for any positive divisor D 
we have 


1 


IL: K](deg Ky??? <r(D)(2 + €)h(B) + O(1). (4.3) 


5 Main results 


We begin with a simple lemma on how r((gy”)* (D) behaves as n — co when D is 
a divisor that does not contain an exceptional point of g. We recall that in general 
if D= er m;q; is a divisor on P! and w : P! —> P' is a nonconstant rational 
map, then 


wD= > SY) meelB;/0i)Bi (5.1) 


i=] W(B)) =a; 
where e(f; /a;) is the ramification index of y at f;. 


Lemma 5.1. Let D be a divisor such that Supp D does not contain any exceptional 
points of p. Then limg-+o0 roy" D) =0. 

Proof. Recall that w is an exceptional point if and only if g?(a@) = a and ¢ is 
totally ramified at both aw and g(a). Since g has at most two totally ramified points, 
it follows that if w is not exceptional, then one of a, g(a), and g(a) is not a totally 
ramified point of g. Since the degree of gy? is d*, this means that for any divisor 
E such that Supp E does not contain an exceptional point, we have r((y°)* E) < 
d*r(E) (by (5.1)), so r((y?)*E) < d?(d — 1)r(E) Now, since Supp D does not 
contain an exceptional point, Supp(y*)* D does not contain an exceptional point 


for any &. Thus, for any k > 3, we see that re") is less than or equal to ((d — 
1)/d)*—?/77(D), which goes to zero as k goes to infinity. o 


5.1 Using Roth’s theorem 


Roth’s Theorem allows us to prove the following lemma. The idea of the proof 
is that if p*+‘(B) approximates D very closely, then gy (£) approximates (y")* D 
very closely. Since g*(f) has height approximately equal to 1/d‘ times the height 
of g*t*(B), this makes h(y*(B)) small relative to A(t) p(B). Repeating this for 
infinitely many gy‘ (f) gives a contradiction to Roth’s theorem. This idea is due to 
Siegel [Sie29]; similar arguments can be found in [Sil93]. 
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Lemma 5.2. Let D be a positive divisor on P! such that Supp D does not contain 
any of the exceptional points of ~. Let B be a point in P'(K) for which there is a 
strictly increasing sequence of integers (e;)°2, such that p% (B) ¢ Supp D. Then 


i=l 


Adv(y* 
i Adw(G" (B)) _ 0. (5.2) 
i->oo dé 
Proof. Let L be a finite extension of K for which 8 € P!(L). Choose 5 > 0. By 
Lemma 5.1, we may pick an integer £ such that ry) 
rg" D+) 
d 


< 6/2. We may then write 


= 8 for some € > 0. For any e;, we have y—"(B) ¢ Supp(y)* D 
since p(B) ¢ Supp D. Thus, applying Roth’s Theorem (as expressed in (4.3)), we 
find that for all e; we have 


1 


[Ls K]deg Kono" B) < r((g')* D)(2 + e)h(p*“(B)) + OC). 


Using (4.2) and the fact that h(y* (B)) < d‘h,(y*—"(B)) + O(1), we then obtain 


1 


[L : K](deg K) p(y" *(B)) + OC) 


Apwv(y" (B)) < IL: K](deg Ky 


< r((y)*D)(2 + Oh(y—“(B)) + O11) 


£ *D)(2 
— MEY MEF 1g (B)) + OW) 


< 6h(y" (B)) + OU) 
< bd%h(B) + O(A). 


Dividing through by d“ gives 


i 1 dv(p% (B)) 
a 


<[L: K](deg K)éh(B). 


Since Ap y(g% (B)) = 0, letting 5 go to zero gives (5.2), as desired. Oo 


This allows us to prove the following proposition, which will be used to prove 
Theorems 5.6 and 5.7. 


Proposition 5.3. Leta = [s : u] be a nonexceptional point in P!(K). Then for any 
point B = [a : b] in P'(K) and any strictly increasing sequence of integers (e;)~, 
such that p(B) # a, we have 


i log |uPe; (a,b) — sQe,(a,b)|y _ ,, log max(| Pe; (a, b)|y, | Qe; (a, b)|y) 
im = lim ; 


i->oo dé i-oo dé 


Proof. Note that we know that the limit on the right-hand side of the equation above 
exists by the discussion at the beginning of Section 3. 


Equidistribution and generalized Mahler measures 621 


If [1 : O] is an exceptional point of g, let U/ be its attracting basin; if [1 : 0] 
is not exceptional, let / simply equal {[1 : O]}. We will divide (e;)%2, into two 
subsequences: one consisting of the e; for which g%(6) ¢ U and one consisting 
of the remaining integers in the sequence (e; Jim Let (¢; a1 be the subsequence 
consisting of all integers ¢; in (e;)?2, such that v' (B) € U (this subsequence may 
be empty). We have 


4 max(log| Pr, (a,)/ Qu, (a.b)\v.0) _ 


jroo dti 


(5.3) 


If [1 : O] is not exceptional, this follows from Lemma 5.2 applied to D = [1 : OJ, 
along with (4.1). If [1 : 0] is exceptional, the fact that p“/ (8) ¢ U for all j implies 
that | Pe; (a,b)/Qze; (a, b)|, is bounded for all 7, so (5.3) clearly holds. It follows 
immediately from (5.3) that 


__ logmax(| Pe, (a, )|v, |Qze, (a, b) |v) _ log|Qz,(a,5)|v 
lim = lim ————. 


joo d‘i joo d‘i 


(5.4) 


Note that if wu = 0, then 
uP¢, (a,b) — sQ¢, (a,b) = sQz, (a,b), 


so we are done. Otherwise, by Lemma 4.2, we have 


Po; (a,b) ‘ 
max (0,—tog Oi, Dah =e ) 
lim = 0. 
joo d‘i 
Combining this with (5.3), we see that 
Pu; (a,b) ‘ 
os | aaa) ~ ul, 
lim *=0. 
joo di 


Thus, using (5.4), we obtain 
= log |uPe; (a,b) — sQ¢, (a, b)|v 


joo d‘i 
Pe; (a,b) . 
toe (IO, (a,Dylsuly| pene — ) 
= lim i 
jroo hal 
1 Po; (a,b) 5 

a Ai ee 7 8) Oj @b) ul, 
ki lim : 
joo a es dati 
fee Gres 

lim - - : 
joo di 


as desired. 
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Now let (™m ;)°2 42, be the subsequence of (e;)72, consisting of all integers mj; in 
(e;)?2, such that oi (B) € U (this subsequence may also be empty). If a = [1 : OJ, 
then [1 : 0] is not exceptional by assumption, so there are no m; and we are done. 
Otherwise, we have 


|sQm; (a, b)\y 


im ———_ = 
joo |UPm (a, b)\v 


Pin ( 


since Om AC “ 


goes to infinity and u ~ 0. This implies that 


_ log |uPm; (a,b) — 5Qm; (a, b)|v 
lim 
joo di 
_ log |uPm, (a, 5)|v 
= lim — —2— 
joo d™i 


= log max(| Pn; (4, b)\v, |Qm; (a, b)\y yy 


j7oo di 


Since every element of the sequence (e;)°2, is in (¢ Dea or (m pyen this 
completes our proof. oO 


5.2 Preperiodic points 


Proposition 5.3 provides all the information we need when g*({a : b]) = [s 
u] for at most finitely many k; this will always be the case when [s : u] is not 
periodic. When [s : u] is periodic, however, there may be infinitely many k such 
that y*({a : b]) = [s : u]. New complications arise when this is the case; we treat 
these complications in Propositions 5.4 and 5.5. 

Suppose that (bT) — aT)" is the highest power of (bT) — a7\) that divides 
uP; (To, T\) - SOK(To, T\) in K[T, T\]. We write 


uP; (To, T) — SOx (To, T1) = (BT — aT,)"* Gx (To, T1), 


where Gy, is a polynomial in K[To, T;] such that G; (a,b) 4 0. 


Proposition 5.4. Let [s : u] be a nonexceptional point of g. Then, with notation as 
above, we have 


i log |G; (a, b)|v _ logmax(| Px (a, b)|y, |Qx(a, byl) 
im ae = = lim 


k—>oo k-oo dk 


(5.5) 


Proof. By Proposition 5.3, equation (5.5) holds if we restrict to the k for which 
g*({a : bl) # a. If there are only finitely many k such that g*([a : b]) = a, 
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we are therefore done. Otherwise, let 7 be the smallest positive integer such that 
gv! ({B : 1]) = w and let ¢ be the period of w. Then g*([B : 1]) = @ precisely when 
k is of the form j + mé for some integer m > 0. If v'([s : u]) = [s : ul], then 
uTy — sT, divides uPe(To, T1) — sQe(To, T1). 

Suppose that uv # 0. Then, expanding Q, in the variables u7y — sT; and T;, we 
see that since u7y — sT, cannot divide O¢(To, T,) (because if it did, then it would 
also divide Pe(To, T;) and we know that Q¢ and P¢ have no factors), we have 


£ 
Ov(To, Ti) = goT? + (uTo — 8T,)W(To, T1) 


for some nonzero go € K and some W(Tp, T\) € K[To, T,|. For any m > 1 we 
thus have 


£ 
Ome = g0(Oum—10)" + (uPon—1)e = SQ (n—1)0)W(Pon-1e. Q(m—1)e)- 


Using induction, we see then that 


Dido a" ame 
Ome(To, T1) = 85 Tf + (uTo — 8T,)Wi (To. T1), (5.6) 
for some polynomial W,,(To, 71) € K[To, Ti]. Similarly, we may write 
uPe(To, T1) — sQe(To, T1) 
= (uly —sT1) f, TE + UT — 5T1)"*'V (To, Th), (5.7) 


for some nonzero f, € K, some integer r > 0, and some V(7p, 7) in K[To, 71]. 
Since [s : u] is not an exceptional point of g, we have r < d° (note that if r 
were equal to d‘, then g would have to ramify totally at g([s : u]),....9°[s : ul], 
which would imply that € = 2 and that [s : u] is therefore an exceptional point, as 
explained in Section 2). Then for any m, we have 


uPine — SOme = (uP(m—1)e = SOm-—ne)’ OG. ye 
- (Pon—1e — SQm—1e)’t'V(Pon—ve, O(m—1)e), 
so, using (5.6), (5.7), and induction, we obtain 


uP ne(To, T1) 7 SOme(To, T\) 


m Sl mt MTN git—pi 
= (uTy = sT,)" prime Te e T grime (d ) 
+ (uTo — 8T1)" t! Zn(To, T1), (5.8) 
for Zm a polynomial in K[Tp, T;]. Since r < d°, we have 


veo a 
lim Yo a log | go ly _ log |golv 
m—>oo qint moo qint d’—-1- 


ml oi mle sie i 
i= j=o (d"*— 
log | fr" gy | 
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Now, let € be the highest power of a7) — bT, that divides uP; — sQ;. Using 
(5.8), we see that we have 


uPj4+me(To, T\) = SQ j4me(To. T;) = (bTo _ aT)" Gj+me(To, Ti) 
for a polynomial Gj+me € K[To, T\]. Letting m go to infinity, we see from (5.8) 


that 


lim log |Gj4me(a,b)|y log |goly _ los lQi@, by 
FiseS qitmt — di(d'—1) di ; 


Similarly, (5.6) yields 


log |O j4me(a, b)|y log |goly log|Q;(a,b)|y 


mi=>00 ditmt ~ di(d’—1 di 


Moreover, since uPj4me(a,b) = SQ j4me(a, b) for every m, we have 


log | Pj+me(a, b)|y 


] j +m ,D v 
fit = tim O8lQitmel@, Diy 


m—>oo di +mé m—>oo qditme 
Hence 
ha log|Gj+me(a,b)|y _ lim log max(| Pj +me(4, b)|v. |Q j+me(a, b)Iv) 
m—>oo qitme ~~ m—00 qitme : 


which completes our proof in the case u # 0. The proof in the case u = 0 proceeds 
in exactly the same way, using 7p in place of 7). oO 


We have a similar result for the polynomials Ty P, — T; Ox. We write 
Ty Pe (To, T1) — T; Ox(To, Ti) = (bT) — aT)" Ay (To, Ti) 


where H; is a polynomial in K[7o,7;] such that Hy(a,b) 4 0. The proof of 
the following proposition is similar to Morton’s and Silverman’s proof of [MS95, 
Lemma 3.4], but it requires a bit more detail since it yields information about 
Hi, (a,b) as well as ng. 


Proposition 5.5. With notation as above, we have 


fim OS MAX Pe, b)lv. |Qe(@, Oly) 
— 1m a 


lim 
k->oo dk 


k—oo 


log | Hk (a, b)|v 
ie (5.9) 
Furthermore, nx remains bounded as k goes to infinity. 


Proof. If (e;)?2, is a strictly increasing sequence of integers such that p* ([a : b]) 4 


[a : b] for each e;, then 


He; (To, T1) = To Pe; (To, T1) — T1 Qe; (To, T1) 
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for all e;. Hence, by Proposition 5.3, we have 


_ log |He,(4,5)|y _ |, log max(| Pe; (a, b)|v, | Qe; (a, b)|v) 
lim ————————_ = _ lim : 


i—0oo dé i30oo dé 


If [a : b] is not periodic, this finishes the proof. Thus, we may assume that [a : ] is 
periodic. The rest of the proof is a computation. We divide it into three steps. 


Step I. We begin by changing variables so that [a : b] becomes [0 : 1]. If b = 0, we 
write Up = T,/a and U; = —To. We then let 


1 
R(Uo, U1) = 7 Po T;) 
and 


S(Uo, Ui) = —P(To, Ti) 


(this is simply the inverse of the transformation we defined on 7p and 7); our change 
of variables is obtained by conjugation by a change-of-basis matrix). If b 4 0, we 
write U; = ae and 


Up = bTy — aT. 
We then let S(Up, Uj) = O(To, T;)/b and 
R(Uo, U1) = bP(T, T1) — aQ (To, T1). 
We define R;, and S; recursively by letting R; = R, S; = S, and setting 
Rx41(Uo, Ui) = Re(R(Uo, Ui), S(Uo, Ui) 


and 
Sx+1(Uo, U1) = Sk(R(Uo, U1), S(Uo, U1). 
By the construction of our change of variables, we have 


U, Re (Uo, U1) — US (Uo, Ui) = To Pe (To, T1) — T1 Ox (To, T1) (5.10) 


as polynomials in Ty and 7;. Hence, if Uj“ is the highest power of Up that divides 


U; Re (Uo, Ui) — UpSk(Uo, U1) and t; is the coefficient of the Up" udian term in 
U, R,(Uo, U1) _ Up SK (Uo, U;), then 


Tk = Ay (a, b). 


Now let £ be the smallest positive integer for which g“({a : b]) = [a : b]. Note 
that |Sme(1,0)|y = anebb if b A 0 and 


|Sme(1, 0)|, = |Pme(a, b)|y/|aly 
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otherwise. Since 
[Pine(a, b) : Ome(a, b)| = la : b] 


for every m, it follows that 


log | Sine(O, 1)|, F log max(| Pine(a, b)ly, |Ome(a, b)|,) 
lim ——\——— = lim : 


m—>oo qm m—>oo qint 


Thus, it will suffice to show that 


: log ITimely ‘ log | Sine(O, 1)|, 
ao ge me gee vie 
We write 
dé 
F tj 
Re(Uo, U1) =). fiUjUP 
i=l 
(note that Up divides Ry by our change of variables) and 
dt 
‘ vs 
Se(Uo, Ui) = >> gi Ujue. 
i=0 
Using induction, we see that 
ae dit —m me 
Rne(Uo, U;) = mg 2 j=0 : UwUe ‘1 (mod Uo) 
and 
Sme(Uo, U\)= = gm =a!" uem (mod U3). 
Thus, we have 
U; sakes o — UpSime(Uo, U1) 
go m me 
= 5° Gi /g0)" —1)UpUf” (mod UG). (5.12) 


Step II. We will now treat the m for which (fi; /g0)"" 4 1. We have 


[log |(fi/g0)” — Ivy < AC fi/80)”" — 1) S 2m[K(fi/g0) : K\A(fi/80) 


for all m such that (fi/go)” #4 1 (this is a simple version of Liouville’s 
theorem), so 


log |(fi/0)” — lly 


lim mi 


mo 
(fi/g0)"Al 


=0. 
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Thus, dividing (5.12) through by Uo, we obtain 


rrnp alt 
fm LOeltmely _ 4 log lgo’ Wy _ Hog |Sme(O, Dy 
m—>oo qm Sia qime Psa qi : 
(fi/g0)"Al 
as desired. 


Step III. We are left with treating the m for which (f1/go)” = 1. Let p be the 
smallest positive integer m such that (f1/go)’” = 1 and write w = pt. For g > 1 
we write 
I” 
Rent => uo 
i=l 


(the summation starts at 1 since Up divides Ry.) and 


die 
Sqw (Uo, U;) = a, yl Uj yee 
i=0 


Since i; = 8 by assumption, we have yll = xl by (5.12). Multiplying R,, 
and S,, through by a constant will change all of the limits we are calculating by the 
same fixed amount, so we may assume that yl = = xl = 1. Let r be the smallest 


integer greater than 0 such that xl x yl (we have r > 2 since (f,/go)” = 1). 
Then Uj divides U; Ry — UoS., which in turn divides U; Ra — UoSga for any q; 


hence U; divides U; Roo -UnSra for every g, so a= = ye, for 7 <r. To calculate 


xi _ yi, we introduce some notation: we let 


M 
(> 1; Up um) =t; 
i=0 


J 


for any polynomial ).™., ;UiU~. We have 


-1 i @-No_j 
osha (mau), 


i=1 


r-1 
= Sry (Ry (SM) (5.13) 
j=0 


For any i <r, we have xl! = = yl, so (Up Rw); = (Ui; Sy);. Hence, we have 


((Ro)! (Sw ye Do_ +) _ (Ry HG jr) 
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for j > 0. For 7 = 0, we have 


(se) = (Ro + Gl — yl usu) se") 


r 
—)o_ 
= (RSJ) + Gly), 


since yl = xl] = 1, 


Using equation (5.13), we see that 


x yl = > xP (Ry 2) ) 


i=1 


r 


r-1 
= . (q-lho_;_ 1 —1 
— Sol (RoHS MI) + GP — ELD) 
j=0 


= - r (q—-No_, 
+ (x — yl) (Ra) So) 
= (xl = yl jade) a (x4 = yest ; 


We have ye" = xa = |, since yl! = xl = |. Thus, assuming inductively 


that 
= —l1 1 
af Noy = GG ys 
we have 
xlfl — yl) = g(xlll — yl). (5.14) 


Note in particular that n,.. = r for all g, so nx is bounded for all k, as desired. 
Now 


1 
log lg(xt! — yyy _ 


a q12 
and Tyo, = xia yf. Since S,..(1, 0) is simply ye = |, we have 
log |Tga|y ~ log |Sga(0, 1)], 
_ £ | Te 4h =. & | Sqeo( Jv 
q>oo |= 1” q->0o dq 


which gives us (5.11) and thus completes our proof. Oo 
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5.3 Proofs of the main theorems 


Now we can show that the integral ric, log |f — B|vdjzy can be computed by 
taking the limit of the average of log |B — w|, on the points in g~* (a), as k > 00, 
for any nonexceptional point a. 


Theorem 5.6. Let a = [s : u] be a nonexceptional point in P!(K). Then for any 
nonzero polynomial F(t) € K|t] we have 


1 
1! F yd y= i — ] F eo 
= 08) Fly dpa Nn oe dS log |FOw)| 


o* (wil) =o 
F(w)40 
where the [w : 1] for which g* ([w : 1]) = @ are counted with multiplicity. 


Proof. The polynomial F' factors as F(t) = y TTje,(t — Bi), where y and 
Bi,..-, Bn are elements of K. For each f;, the multiplicity of B; in (y")*« is at most 
r((p*)*a) (where r((y*)*«) is defined as in Section 4). Since @ is not exceptional, 


we have limz-+o9 rw ya) = 0, by Lemma 5.1. Thus, 


ee al | 
sige Me eee Male 
og ((wil|)=a ¢* ((wilf)=a 
w#Bj F(w)40 


for each B j- Hence, it suffices to show that 


g.. A 
[-.)!98l —Bleditew = him, Fe y log |w— Bl, (5.15) 


o* (wil)=a 
we 


for any B € K. 
Note that g*([w : 1]) = [s : u] if and only if uP (w, 1) — sOx(w, 1) = 0. Thus, 
as polynomials in t, we have 


uP(t,1)—sOt.=m IT] @-). 


of ([wi1)) =[szu] 
where n;. € K. We write 
uP;(t, 1) —sQOx(t, 1) = (t — B)"* Get, I) 


for a polynomial G; such that G; (6, 1) 4 0, as in Proposition 5.4. Note that 


Gt =m [] @-w). 


o* (fw:l) =a 
w#B 
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Plugging f in for ¢ and taking logs of absolute values gives 


log |Gx(B. Vly =logInly+ >>  loglw—Bly. (5.16) 
o (fw: =[s:u] 
w#B 
Applying Proposition 4.4 therefore yields 


log |nxly 
dk 


1 
lim —; ~ log|w — Bly + 


o* (wil) =a 
w#B 


— jim Loemax Pe (B, Div. 1Qe(B, Db) 
= um 7 
k—->0oo dk 


(5.17) 
Now, writing 


uP; (To, Ti) — sQx (To, T1) = T,* Ve(To, Ti) 


for some polynomial V;, such that V; (1,0) 4 0, we see that yn, = V.(1, 0). Applying 
Proposition 4.4, we obtain 


_ log|nely — ,. log max(|Px(1, 0)|v, |Qx (1, Oly) 
lim ——— = lim : 
k—->0o d k—>0o d* 


Substituting this equality into (5.17) gives 


_ i . logmax(| Px (B, Iv. |Ox(B, Dv) 
we ? log|w— Bly = jim 


k—>00 d* 
o* (fw:l)) =a 
[w:l]#B 
. logmax(|Px(1, 0)|,, |Qx(1, 9)|,) 
— lim : 
k—>oo dk 
(5.18) 
Using Proposition 3.1, we obtain (5.15). oO 


Now we show that the same result holds when we average log|6 — wl, over 
periodic points rather than inverse images of a point. 


Theorem 5.7. For any any polynomial F € K{t] we have 


1 
log|F|,dug, = lim — log|F We 
[i .,!9elFhaties = lim ae les FO 


¢* (bw:1)=[w:1] 
F(w)40 


where the [w : 1] for which g* ([w : 1]) = w are counted with multiplicity. 
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Proof. As in the proof of Theorem 5.6, it will suffice to show that 


1 
log|t — Bly dig» = lim ae log |w— Bly 5.19 
ee oglt—Blvdue = lim =D) loglw— Bly (5.19) 


¢* (wil) =[w:1] 
w 


for any 8 € K (this follows from the fact that the multiplicity of each f; as a 
k-periodic point is bounded for all k by Proposition 5.5). 
We have y* ([w : 1]) = [w: 1] if and only if Py (w, 1) — wOx(w, 1) = 0. Thus, 


POW -tOc¢ =r [] @-w), 


¢* (wil) =pv:1] 
for some yz € K. We write 
Px(t, 1) = tOx(t, 1) = (t = By" A(t, 1) 


for a polynomial H;, such that H;,(6, 1) 4 0. We have 


A(t. =v YT] @-w). 


¢* (wil)=[w:1] 
w#B 


Then, plugging f in for ¢, taking logs of absolute values, and applying Proposition 
4.5 gives 


log |Yelv 


. 1 
lim rT > log |B —wly + 7k 
¢* ((w:1)=[w:1] 

FB 


w 


— im loemaxd Pi (B, Diy 1Qe(B, Dlv) 
= lim : 


Jim, rT (5.20) 


Writing 
T; Pe(To, T1) — ToQx (To, T1) = T;"* Wi (To, T1) 
for a polynomial W; such that W.(1,0) # 0, we see that y, = W,(1,0). By 


Proposition 4.5, we have 


_ loglyely — |, logmax(|Px(1,0)|,,|Q«(, 0)|») 
lim ——— = lim . 
k—oo d k—oo dk 


Combining this equality with (5.20) and Proposition 3 yields (5.19). oO 
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We are now ready to prove the results regarding the computation of the canonical 
height h,(f). First, we'll need a lemma. Note that the lemma does not follow 
directly from the work of Call and Goldstine [CG97], since they only prove that in 
a fixed number field, the local canonical heights sum to the global canonical height. 
What is required here is slightly different. 


Lemma 5.8. Let B = [a: b]in P\(K). Let [a, : bi],..., [an : by] be the conjugates 
of [a : b] under the action of Gal(K / K). Then 


[K(B) : K](deg Kyho((a : 6) 
= im So onmaxdl Palas) 1e(@i- bd) 


k—>oo * dk 
places v of K i=l 


(5.21) 


Proof. For all but finitely many v, we have |a;|, = |b;|, = 1. Furthermore, for all 
but finitely many v, we have 


log max(| Pe (s, 1)|v.|Qx(s. Dh») = 0 (5.22) 


for all k whenever |s|, = |t|, = 1. This is true, for example, at all nonarchimedean 
v of good reduction for g in the sense of [PST04]. Indeed, when v is a finite place, 
(5.22) will hold for all |s|,, = |t], = 1 unless either |Res(P(To, 1), Q(To, 1))|y or 
|Res(P(1, 7;), O(1, T)))|, is less than 1, where Res is the usual resultant of two 
polynomials (see [BK86, p. 279, Proposition 4]). Thus, we can interchange the limit 
and the sum on the right-hand side of (5.21) so that 

. “. log max(|Px(a;,b;)|y. aj, bi)» 

ie > 8 (| Px ( “i |Ox( Mv) 


k-oo ‘ 
places vof K i=1 


n 


_ ; log max(| Px (ai, bj) |v, |Ox (ai, bi) |v) 
= ys Jim DU dk ; 


(5.23) 


places v of K i=l 


Now let L be the field K(f) and let w be a place of L that extends the place v of K; 
we write w | v. The field L has n embeddingsi : L <> C,; for exactly [L,, : Ky] 
of these embeddings, we have |i(x)|, = |x|» for all x € L. This yields [Ly : Ky] 
conjugates [a’ : b’] of [a : b] such that |a|,, = |a’|, and |b|,, = |b’|,,. Hence, 
we see that 


Y > log max(| Px (ai, bi)\v, |Ox (ai. bi) |v) 


i=l 


= [Ly : Ky]log max(| Px (a, b)|w. |Ox(a. B)|v)- 


wiv 
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Thus, we have 


= Slog max(| P; (ai. bi)|v, |Qk (ai. bi) |v) 


places vof K i=1 


= [K(B) : K](deg K)h(g* (a : b))), 


by (1.0.6). It follows from (1.0.7) and (4.8.3) that we therefore have 


: - log max(| Px (ai, bi) |v, |Qx (ai. bi) |v) 
> Jim) dk 


places v of K i=l 


K (lag: 
= [K(B) : Ky(deg ) im “PE? *D) 


= [K(B) : K](deg K)hg([a : 5). o 


Theorem 5.9. Let a be any point in P!(K) that is not an exceptional point of 9. 
Then, for any B € K and any nonzero irreducible F € K[t] such that F(B) = 0, 
we have 


(deg K)(deg F)(hg(B) — hg(oo)) 


= > jim = Jy) log |FW)h 


places v of K o* ((wilf)=a 
F(w)40 


where the [w : 1] for which g* ([w : 1]) = @ are counted with multiplicity. 


Proof. Write F(t) = yJ[]j=,(@ — Bi) where y € K and the f; are the 


conjugates of 6 under the action of Gal(K/K). By the product formula, we have 
ae vor x log |yl» = 0. Thus, using Theorem 4.6 and Proposition 2.1, we see that 


~ tmz YL bos FO 


places v of K g* (bw:lf)=a 
F(w)40 
1 n 
= E inge E lfloao 
places v of K ¢* (w:l))=a i=l v 
F(w)40 

fe 2 log max(| Px (B;, 1)|), |Ox(Bi, 1) |v) 

= lim 
k->0o dk 


log max(| Px (1, 0) |v, |Qx« C1, 0)l») 
d* ; 


—(deg F) lim (5.24) 
k—-oo 
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By Lemma 4.8, the quantity on the last two lines is equal to 

(deg F) (deg K)(hy(B) — hg(oo)), 
as desired. Oo 
Theorem 5.10. For any B € K and any nonzero irreducible F € K{t] such that 


F(B) = 0, we have 


(deg K) (deg F)(hg(B) — hg(oo)) 


= > Jim = log | F(w)|,. 


places v of K ¢* (wil) =[w:1] 
F(w)40 


where the [w : 1] for which g* ([w : 1]) = w are counted with multiplicity. 


Proof. The proof is the same as the proof of Theorem 4.9, using Theorem 5.7 in 
place of Theorem 5.6. oO 


6 A counterexample 


The main theorems of this paper are not true when we work over the complex 
numbers C rather than K. Let K = Q and let y([x : y]) = [x? : y?] be the 
usual squaring map. Let v be the archimedean place of Q, so that C, is just the usual 
complex numbers C. We define the function y on the positive integers recursively 
by w(1) = 2 and W(n) = 2@¥@-))_ Leta = 0°, 1/w) and let B = e?7?¢, 
Note that for any f, we have |e?”'’ — 1| < (t —[t]) (where [f] is the greatest integer 
less than or equal to ¢). Letting ¢,, = log, y(n), we then have 


1 log |BY™ —1| 
yews, 
Qn REI P| w(n) 


w2'n =] 


= 


1 
Toa 8" onda — [(n)a)) 


1 vin) G1 
: y(n) ia Win +1) dX Div+1) 


<logxm + 1—nlog2 + log2. 
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1 
Thus, 7, Do ,,2% —, log |B — w|y goes to —oo as n > 00, so 


I 
Jim = Dy loglw—Bh 


wrk =] 


does not exist. 


7 Applications and further questions 
7.1 Lyapunov exponents 


The Lyapunov exponent L(@) of a rational map ¢ : Pe —_— Pt (see [Man88]) can 
be defined as follows. Choosing coordinates [To : T;] for P{, letting ¢ = To/T, and 
writing g(t) = P(t)/ Q(t) for polynomials P and Q, we define 


L@ = i log|o' (dtp. 
PI(C) 


where jl, is the unique measure of maximal entropy measure for g on P!; this 
measure of maximal entropy is the same as the Brolin—Lyubich measure discussed 
in Section 3 (see [Mang3]). 

The Lyapunov exponent can be computed via equidistribution on certain sub- 
sequences of inverse images of nonexceptional points in P!(C) (see [DeMO03], 
[Mafi88]). That is, given a nonexceptional point a in P!(C), there is an infinite 
strictly increasing sequence of integers (7m; )?2, such that 


1 
L(y) = lim ———_ log |y'(B)|. 
i>oo (deg gy)" oe, 


' (B)#0 
B00 


It is not known, however, whether L(y) can be computed by taking the limit of the 
average g’ on the periodic points of g. 

When ¢ is defined over a number field K, however, we obtain the following result 
as a corollary of 5.17 Theorem. 


Corollary 7.1. Let K be a number field and let 9 : Pe — Pe be a nonconstant 
rational map that is defined via base extension from a map ¢ : Pe — Pe. Let 
be defined as above. Then 


1 
L(y) = lim ——— log |y'(E)|.- 
Oe aaa: x og |y’(&)| 
g* (E)=E 
gy’ (é)#0 


E#o0 
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Proof. We may write gy’ as a quotient of polynomials A(t)/B(t) with coefficients 
in K. This yields log |g’(t)| = log|A(t)| — log|B(t)|. The corollary then follows 
immediately from Theorem 5.7. oO 


This corollary says that if g is a rational function defined over a number field, 
then the Lyapunov exponent of g is completely determined by the derivative of @ at 
the periodic points of g. This means that the derivative of g at the periodic points of 
gy also determines the Hausdorff dimension of the Julia set (see [FLM83]). 


7.2 Computing with points of small height 


The results in [Bil97], [AutO1], [BRO6], [FRLO4], [FRLO6], and [CL06] all apply 
not only to the periodic points and backwards iterates of a point that we treat in this 
paper but to all points of small height in the algebraic closure of a number field K. 
For example, one of the main theorems in [BR06], [FRL04], [FRLO6], and [CL06] 
states that for any continuous function g on P!(C,) and any infinite nonrepeating 
sequence of points (q,,) in P! (K) such that limy—+oo Mg(a@,) = 0, one has 


1 
ig a 
neo |Gal(@i,)| d, slay) [ 


gdpyo, (7.1) 
a €Gal(a, ) PCy) 
where Gal(q,,) is the Galois group of the Galois closure of K(a,,) over K. 

Baker, Ih, and Rumely [BIRO8] and Autissier ([Aut06]) have produced coun- 
terexamples that show that (6.1.1) does not always hold when the function g is 
replaced with log|F'|, for F a polynomial. All of these examples involve infinite 
nonrepeating sequences of points (q@,,) € ©) such that limy,—+o99 h(a) = 0 and 


1 


lim ——— 
noo |Gal (a )| 


1 
log loz —21 4 [ log |e7"!° — 2\d6. 
0 


o€Gal(a@, ) 


The points (@,,) are not preperiodic in any of these examples. Thus, it may be 
possible to prove that the main results of this paper continue to hold when we work 
with any nonrepeating sequence of Galois orbits of preperiodic points. This would 
imply the following conjectured generalization of Siegel’s theorem for integral 
points. 


Conjection 7.2 (Ih). For any nonpreperiodic point B € Pee (K), there are at most 
finitely many preperiodic points of ~ in Poe (K) that are integral relative to B. (Here, 
0x is the ring of integers of K and a is said to be integral relative to B if the Zariski 
closure of a does not meet the Zariski closure of B in Pie 
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Baker, Ih, and Rumely have proven that this is true when ¢ is a Lattés map or the 
usual squaring map x +> x? . Using Theorem 5.10 and arguing as in [BIRO8] (or 
as in [Sil93], which presents a related result), it is possible to derive the following 
weak version of Ih’s conjecture in general. 


Proposition 7.3. For any nonpreperiodic point B € P!(K), there are at most 
finitely many n such that all a € P'(K) of period n are B-integral. 
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Abstract We give some properties of admissible smooth representations of a 
reductive p-adic group G over a field of characteristic p, deduced from Pontryagin 
duality, Ollivier’s equivalence of categories when G = PGL(2, Q,,), and finiteness 
properties of the Hecke ring of a pro- p-Iwahori-Hecke algebra. 
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1 Introduction 


La théorie des représentations lisses des groupes réductifs p-adiques sur un corps de 
caractéristique p en est a ses débuts, et un expert des représentations sur un corps de 
caractéristique différente de p est désemparé sans mesure de Haar. Nous avons réuni 
ici quelques propriétés de l’admissibilité, qui sont des applications assez faciles de 
trois théories: la dualité de Pontryagin, bien connue des arithméticiens p-adiques, 
P’équivalence de catégories de Ollivier pour GL(2,Q,,)/ p®, et les propriétés de 
finitude des Z-algébres de Hecke d’un pro- p-sous-groupe d’Iwahori. 
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Soit ZL un corps commutatif localement compact de caractéristique 0, d’anneau 
des entiers Oz, d’uniformisante p,, de corps résiduel kz, de caractéristique p. 
Posons A := L, O; ouky. 

Soit G un groupe analytique p-adique, ou ce qui est équivalent: G contient un 
pro- p-sous-groupe ouvert isomorphe a un sous-groupe fermé de GLa (Z,) for some 
d > 1 [4] 8.33 page 201 et Interlude A (0),(n) page 97. 

Une A-représentation V de G est un A-module topologique sur lequel G opére 
contintiiment. Selon que A = L,Oz,kz, on dit que V est p-adique, p-adique 
entiére, modulo p. 

Pour toute partie H de G, on note V” le A-sous-module des v € V fixes par 
chaque élément de H. Lorsque V = UyV™” ot H parcourt les sous-groupes 
ouverts compacts de G, on dit que V est lisse. 

Une A-représentation lisse V telle que V” est un A-module de type fini pour 
tout sous-groupe ouvert compact H de G, est appelée admissible. 

Si V est un O; -module discret de torsion, l’action continue de G sur V est lisse; 
on dit que V est une représentation p-adique de torsion de G. 

Soit 

0- Vi ~ vn > V3 + 0 


une suite exacte de A-représentations lisses de G. Si V3 est admissible, alors V; est 
admissible, et si V; et V3 sont admissibles, alors V2 est admissible, car |’anneau A 
est noetherien. 

Si A = L, l’existence d’une mesure de Haar sur G a valeurs dans L implique 
l’exactitude du foncteur V > V%, et la stabilité de l’admissibilité par quotient. 
Pour A = O,,k_, le foncteur V > V” n’est pas exact a droite et l’on se demande 
si la stabilité par quotient de I’ admissibilité reste vraie. 

Nous allons montrer que la réponse est oui si V est une représentation 
p-adique entiére annulée par une puissance de p, en particulier si V est une 
représentation modulo p, en utilisant la dualité de Pontryagin et le lemme de 
Nakayama topologique [1]. 

Soit U un pro-p-sous-groupe ouvert fixé de G et soit V une représentation 
p-adique entiére de G telle que p‘V = 0 pour un entier k > 1. Le dual de 
Pontryagin VY est toujours annulé par p*, et par restriction 4 U c’est un module 
profini sur la Oy, -algébre d’Iwasawa de U. La dualité de Pontryagin échange les 
U-invariants V" et les U-coinvariants (VY)y et respecte la propriété d’étre fini. 
En appliquant le lemme de Nakayama topologique, on obtient le théoréme 4 qui 
implique les deux théorémes suivants. 


Théoréme 1. V est admissible si et seulement si V" est fini. 


On comparera avec le résultat de Paskunas [6]: toute représentation lisse V sur 
un corps k de caractérisque p d’un groupe topologique G contenant un pro- p-sous- 
groupe ouvert U telle que dim, VY < oo est admissible. La démonstration basée 
sur les enveloppes injectives, n’utilise ni la dualité de Pontryagin ni le lemme de 
Nakayama. 


Théoréme 2. Si V est admissible, tout sous-quotient de V est admissible. 
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Un groupe réductif p-adique est le groupe des points rationnels d’un groupe 
réductif connexe sur une extension finie de Q,,; c’est un groupe analytique p-adique. 
Le groupe des points rationnels d’un groupe réductif connexe sur corps local de 
caractéristique p n’est pas un groupe analytique p-adique. 


Corollaire 1. Si V est une représentation admissible d’un sous-groupe de Levi 
d’un sous-groupe parabolique P d’un groupe réductif p-adique G, alors tout sous- 
quotient de l’induite parabolique indg V est admissible. 


En effet [9] I.V.6, l’induction parabolique respecte l’admissibilité puisque G/P 
est compact par la décomposition d’Iwasawa, la propriété d’étre entiére et annulée 
par p*. 

La seule représentation irréductible de U sur ky, est isomorphe a la représentation 
triviale de dimension 1, et le foncteur d’induction compacte ind est exact. Cette 
remarque et le théoréme 2 impliquent 


Corollaire 2. Une représentation de type fini V de G sur ky, admet une filtration 
G-équivariante finie de quotients cycliques engendrés par un vecteur U -invariant. 
Si V est admissible, les quotients sont aussi admissibles. 


Dans le cas trés particulier ob G = GL(2,Q,)/p%, l’on sait beaucoup plus. La 
raison est la suivante: 

Soit J; le sous-groupe des matrices de GL(2,Z,) congrues au sous-groupe 
strictement triangulaire supérieur de GL(2, F,) modulo p, appelé un p-sous-groupe 
d’Iwahori de GL(2, Q,,), et qui tient lieu de p-sous-groupe de Sylow. On I’identifie 
a son image (isomorphe) dans G. Sa Z-algébre de Hecke Hz(GL(2, Z,), J) est un 
module de type fini sur un anneau commutatif de type fini ([10] th.3), c’est donc 
une algébre noetherienne. Soit F, une cléture algébrique de k,. Par le théoréme 
d’équivalence de Ollivier [7], la catégorie des modules a droite de AR, (G, [,) est 


équivalente a celle des représentations lisses de G sur Fy qui sont engendrées par 
leur vecteurs invariants par J; [7]. 

En utilisant la filtration (corollaire 2) nous obtenons le résultat (bien connu pour 
les représentations complexes ou modulo ¢ d’un groupe réductif p-adique): 


Théoréme 3. La catégorie des F ,-représentations de GL(2,Q,)/ p” est noetheri- 
enne, et une F p-représentation de GL(2, Q,)/p” est admissible de type fini si et 
seulement si elle est de longueur finie. 


On peut remplacer les données F, ou GL(2,Q,)/p” du théoréme par k;, ou 
GL(2,Q,), si le théoréme d’équivalence de Ollivier est valable avec ces nouvelles 
données. 

On ne sait pas si une représentation irréductible V d’un groupe réductif p-adique 
G sur un corps algébriquement clos C de caractéristique p, est admissible ou a un 
caractére central (c’ est vrai si la caractéristique de C est différente de p). On sait que 
V irréductible a un caractére central si C est non dénombrable (preuve classique), ou 
si V est admissible (V/! de dimension finie contient une droite stable par le centre), 
ou si G est déployé et le Ay, (G, [;)-module V" contient un module simple M (car 
M aun caracteére central (fl 1] 5.3 plus [10])). 
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2 Dualité de Pontryagin 


On note Modi, la catégorie abélienne des O,-modules discrets de torsion et 
des applications O;-linéaires, et Modpror la catégorie abélienne des O,-modules 
profinis et des applications O,-linéaires continues. 

Tout V € Modo; est la limite inductive de ses O,-sous-modules finis W, muni 
de la topologie de la limite inductive [2] AII.93; les applications de transition étant 
les inclusions. Tout M € Modpror est la limite projective de ses quotients M/N par 
les O,-sous-modules ouverts N, muni de la topologie de la limite projective; les 
applications de transition sont les surjections. 

Une application linéaire continue dans Modpro¢ est fermée car un O,-module 
profini est compact et séparé [3] 1.63. 

La dualité de Pontryagin 


EY = Homo, (£, L/ Oz) 


pour E = V ou M, satisfait (EY)Y = E, elle échange limites inductives 
et projectives (topologies comprises), L/Oz, et Oz, et induit une équivalence 
contravariante entre Modjo, et Modpror. 

On considére les sous-catégories abéliennes Modior(G) C Modjo, et Modprot 
(G) C Mody o¢ des Oz-modules discrets de torsion ou profinis munis d’une 
application continue de G. Le groupe G agit continuement sur E = V ou M si 
lV application 

(g,x) > gx:GxEOE 


est continue comme fonction des deux variables. Alors V € Modo;(G) est la limite 
inductive de ses Oz, -sous-modules finis W stables par G, et M € Modpror(G) est la 
limite projective de ses quotients par les O, -sous-modules ouverts N stables par G. 
L’action contragrédiente du groupe G sur le dual de Pontryagin EY est définie par 


(gx*, gx) = (x*,x) (g €G,x€ E,x* € E*). 


La dualité de Pontryagin induit une équivalence contravariante entre Modto,(G) et 
Modprot(G). 

Soit H un groupe profini topologiquement de type fini (par exemple un sous- 
groupe fermé de G [4] prop. 3.11, page 51). Soit E = V € Modio(H) ou EF = 
M € Modprop(7) . 

Le submodule E” C E des H-invariants de E est fermé car l’action de H est 
continue. Le module Ey des H-coinvariants de E est E/E(H) ot E(#) est le 
O,-sous-module de EF engendré par 


gx—x (geH,xe BE). 


Proposition 1. (/) E(#) est fermé dans E. 
(2) La dualité de Pontryagin échange invariants et coinvariants 


(E")\’ =(EY)y, et (E/E")Y = EY(H). 
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Preuve. (1) Tout sous-module de E = V est fermé (la topologie est discréte) et le 
probleme ne se pose que pour E = M. Soit Xy un ensemble fini engendrant 
un sous-groupe dense H’ de H. Le O,-sous-module M’ de M engendré par 
(h—1)M pourh € Xy est fermé car M est compact. Ona M(H’) = M' car 
légalité 


hy...h;m—m=hyho...h-m—h2...hym+ho...h,m—m 


pour /1,...,h, € Xy etm € M, montre par induction sur r que h,...h,m — 
m € M’. Lacontinuité de l’action de H implique que M(H’) = M(A1). Donc 
M(A) = M’ est fermé dans M. 

(2) f € EY s’annule sur E(#H) si et seulement si f est invariant par H, 


Gy? = Ga). 


Remplacons E par EY, puis prenons le dual de Pontryagin et l’on obtient 
(E")Y = (EY). Ceci implique (E/E”%)Y = EY(H) car l’on a la suite 


eee 1 > (E/EX)Y — EY > (E#)Y 50. 


3 Le lemme de Nakayama 


Module signifiera module a gauche. Si A est un anneau et N un A-module, on note 
d4(N) € NU c le nombre minimal de générateurs du A-module NV. 


Proposition 2. [1] §3 (Lemme de Nakayama) 
Soient A un anneau topologique compact, N un A-module profini et I un idéal 
bilatére topologiquement nilpotent de A. Alors 


da(N) = daj(N/1). 


Soit U un pro-p-sous-groupe de rang fini (par exemple un pro-p-sous-groupe 
fermé de G). Un O,-module profini muni d’une action continue de U s’identifie a 
un module profini sur la O, -algébre d’ Iwasawa 


A:= O1 [UI] = lim O,(U/U" 
U’ 

ou U’ parcourt les sous-groupes distingués d’indice fini de U. Comme U est de 
rang fini, un sous-groupe est ouvert si et seulement s’il est d’indice fini, l’anneau 
compact A est local, sa topologie est celle donnée par les puissances de son 
radical, et A est noetherien 4 gauche et 4 droite [8] §2.2.7. Tout idéal propre de A 
est topologiquement nilpotent. Le noyau Jy de ’homomorphisme d’ augmentation 
A — O, est l’idéal 4 gauche engendré par 


u—1 (ue Xy) 


ou Xy est un sous-ensemble fini de U engendrant topologiquement U. Le radical 
de A est l’idéal engendré par Jy et pz [5] 2.1.8. 
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Proposition 3. Soit N € Modpro¢(U). Pour tout sous-groupe distingué d’indice fini 
U' CU,ona 
da(N) = do,usu(Nu’). 


En particulier d,(N) = do, (Nv). 


Preuve. Soit Iy;y’ le noyau du morphisme canonique surjectif A > O,[U/U". 
Comme idéal a gauche, Jy est engendré par u — | pour u dans un sous-ensemble 
Xv fini de U’ engendrant topologiquement U’. L’espace des U’-coinvariants de 
N est 

Nur = N/IujuN. 


On applique le lemme de Nakayama a (A, N, [u/u’). 
Corollaire 3. Les propriétés suivantes sont équivalentes: 


i. N est un A-module de type fini, 
ii. Ny est un O,-module de type fini pour un sous-groupe d’indice fini H de U. 
iii. Ny est un O,-module de type fini pour tout sous-groupe d’indice fini H de U. 


Preuve. Un sous-groupe d’indice fini H de U contient un sous-groupe distingué U’ 
d’indice fini de U. La proposition 3 implique que N est un A-module de type fini 
si et seulement si Ny est un O,-module de type fini. I] reste 4 montrer que Ny est 
un O,-module de type fini si et seulement si Ny est un O,-module de type fini. 
La O,-algébre d’Iwasawa de H est un module libre de type fini sur celle de U’, 
engendrée par les images d’un systéme de représentants de H/U’. Donc 


do,gay(N) < do,quy(N) S [A : Udo, ay). 


On applique le cas particulier de la proposition 3 a U’ et a H et le corollaire est 
démontré. 


Corollaire 4. Soit U' un sous-groupe ouvert distingué de U. Alors 


do, (Nu) < do,(Nou’) < [U : U'| do, (Nu). 


4 Application a l’admissibilité 


Soit V une O,-représentation lisse de G annulée par p* pour un entier k > 1. Soit 
U un pro- p-sous-groupe ouvert de G. 


Théoréme 4. (1) Les propriétés suivantes sont équivalentes: 


i. V est admissible. 
ii. Le dual de Pontryagin V restreint 4 H est un O,|[H]]-module de type fini, 
pour tout sous-groupe ouvert compact H de G. 
iii. Le dual de Pontryagin V restreint 4 U est un O,[[U]]-module de type fini. 
iv. VY est fini. 
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(2) SiU’ est un sous-groupe d’indice fini distingué dans U, alors 


do, (V") < do, (V" ) <[U : Udo, V™). 


Preuve. La dualité de Pontryagin respecte la propriété d’étre fini (et non d’étre un 
O,-module de type fini), d’étre annulé par p* (et non d’étre de torsion), et échange 
U-invariants et U -coinvariants (proposition 1). Un O,-module de torsion est de type 
fini si et seulement s’il est fini. On en déduit que V est admissible si et seulement 
si (VY) est un O, -module de type fini, pour tout sous-groupe ouvert compact H 
de G. On applique alors les corollaires 3, 4. 


Nous démontrons le théor’me 2. Un quotient W de V est annulé par p*. 
La dualité de Pontryagin est contravariante donc WY est un sous-module de VY. 
L’anneau O;[[U]] est noetherien donc si VY est un O; [[U]]-module de type fini, il 
en est de méme du O;[[U]]-sous-module WY. Si V est admissible, W l’est aussi 
par le théoréme 4. 


5 Représentations de type fini 


Soit U un pro- p-sous-groupe ouvert de G et soit V une représentation de type fini 
de G sur ky. Nous allons montrer que V admet une filtration G-équivariante finie 
de quotients de type fini engendrés par leur U -invariants. 

Soient vj,..., ¥- un systéme fini de générateurs de V, et soit W la représentation 
de U engendrée par ces éléments dans V. Comme V est lisse, dim, W est fini. 
La représentation V est quotient de la représentation induite compacte indG (W). 
La seule représentation irréductible de U sur kz, est isomorphe 4 la représentation 
triviale de dimension 1, et le foncteur d’induction compacte ind est exact. La 
représentation finie W de U sur ky a une filtration U-equivariante de longueur 
r = dim, W, 


OCWCc-::-CW,=W avec dim W;,/Wi-) = 1, 
induisant une filtration G-equivariante, 
0 c ind (Wi) C «++ C ind§ (W,) = inde (W) 
de quotients isomorphes 
ind® (W;)/ ind® (W;_-) ~ ind& (1x, ) 
pour tout 1 < i < r. Limage de cette filtration par l’application surjective 


canonique ind& (W) — V est une filtration G-equivariante de V, 


OCYc:CV=V 
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de quotients isomorphes a des quotients (éventuellement nuls) du “module uni- 
versel” ind@ (1x,). Si de plus V est admissible, les quotients sont admissibles 
(théoréme 2). Le corollaire 2 est démontré. 

Nous remarquons que si V’ est une sous-représentation de G, la filtration induite 
par celle de V sur V’, 


(CV CHC V HV Ver nV, 
a ses quotients isomorphes a des sous-quotients du module universel ind (1x,). On 


déduit de la démonstration du corollaire 2 et de cette remarque: 


Proposition 4. Si tout sous-quotient de ind® (1x, ) est de type fini, alors la catégorie 
des représentations de G sur k 1, est noetherienne. 

Si tout quotient admissible de indé (1x,) est de longeur finie, alors toute 
représentation admissible de type fini de G sur kz, est de longeur finie. 


Ces propriétés sont vérifiées pour G = GL(2,Q,)/p%, par l’équi-valence de 
catégories [7] et la noetheriannité de H;, (G, Z,) [10]. Le théoréme 3 est démontré. 
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Abstract This paper states a conjecture for Nevanlinna theory or diophantine 
approximation, with a sheaf of ideals in place of the normal crossings divisor. This 
is done by using a correction term involving a multiplier ideal sheaf. This new 
conjecture trivially implies earlier conjectures in Nevanlinna theory or diophantine 
approximation, and in fact is equivalent to these conjectures. Although it does 
not provide anything new, it may be a more convenient formulation for some 
applications. 
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in the number field case, and a similar inequality is conjectured for Nevanlinna 
theory. Here a is the ideal sheaf associated to D and .%~ (a) is a type of multiplier 
ideal sheaf. For other notations, see Sections | and 2 and Conjectures 4.1 and 4.2. 

Dropping the condition on D is possible here because of the additional term 
—ms(-4%~ (a), P) on the left-hand side of the inequality. 

This may be a more convenient formulation for some applications. Also, this 
conjecture shows how multiplier ideal sheaves may have a role in Nevanlinna theory 
and diophantine approximation, and therefore may give more information on the 
structure of the situation. 

Section | briefly describes multiplier ideal sheaves, and gives a variant definition 
specific to this situation. It also includes related definitions. Section 2 recalls the 
standard definitions in Nevanlinna theory, as well as their counterparts in number 
theory, that are needed in this paper. Section 3 describes proximity functions for 
sheaves of ideals, using work of Silverman and Yamanoi. Sections 4 and 5 form the 
heart of the paper, giving the conjectures and showing their equivalence to previous 
conjectures. 

Throughout this paper, X is a smooth complete variety over C (in the case 
of Nevanlinna theory) or over a global field of characteristic zero (in the case 
of Diophantine approximation). For the purposes of this paper, a global field is a 
number field or a function field of dimension one. 


Acknowledgements Supported by NSF grants DMS-0200892 and DMS-0500512. 


1 Multiplier ideal sheaves 


Definition 1.1. Let f: X -—> Y be a morphism of smooth varieties. Then the 
relative canonical divisor Ky ;y of X over Y is defined to be the divisor class 


Kyjy = Ky — f*Ky, 


where Ky and Ky are the canonical divisor classes on X and Y, respectively. As 
Lazarsfeld notes [3, below (9.1)], if f is a proper birational morphism then Ky y 
is represented by a unique effective divisor supported on the exceptional locus of 
Jf; a local equation is given by the determinant of the derivative, det(df). In other 
words, this is the ramification divisor. 


Definition 1.2. A Weil divisor D on X is reduced if it is effective and all prime 
divisors occurring in it have multiplicity 1. If D = }~n; D; is a Weil divisor, written 
such that the D; are distinct prime divisors and n; 4 0 for alli, then D,.g denotes 
the divisor )> Dj. 


Definition 1.3. A subset of X has normal crossings if it is defined in a neigh- 
borhood of any point by an equation in local analytic coordinates of the form 
Z1+++Z- = 0 (and therefore is of pure codimension 1). A divisor D on X is anormal 
crossings divisor if it is reduced and if its support has normal crossings. 


Multiplier ideal sheaves, Nevanlinna theory, and Diophantine approximation 649 


Definition 1.4. Let D = )°n; Dj; be a Weil Q-divisor, where n; € Q for all i and 
the D; are distinct prime Weil divisors. Then the floor of D is the divisor 


[D] = oli]; . 


Note that this definition does not respect linear or numerical equivalence. 


Definition 1.5. Let a be a nonzero sheaf of ideals on X, and let c € Rso. Let 
ju: X' — X bea proper birational morphism such that X’ is a smooth variety and 


p*(a) = Ox(-F) 


for a divisor F on X’ with normal crossings support. Then the multiplier ideal sheaf 
associated to a and c is the ideal sheaf 


JF (a°) = bx Ox (Kxrjx = LcF J). 


By a theorem of Esnault and Viehweg [3, Thm. 9.2.18], this definition is 
independent of the choice of ju. 
For our purposes we need a slightly different definition. 


Definition 1.6. Let a and c be as above. We then define 
JF (a°) = lim (a). 
e>0t 


Here we use the discrete topology on the set of ideal sheaves on X, and note that 
the limit exists because there are only finitely many coefficients in |(c — €) F |. 


We also write .%(a) = .%(a') and .4%~(a) = .¥~(a!). 


Example 1.7. Let D be a normal crossings divisor on X and leta = @(—D). Then 
we can take X’ = X, in which case F = D and 


Kyjx = (Ud —©)F] =0, 


so 4% (a) = Cy (the ideal sheaf corresponding to the empty closed subscheme). 
More generally, if D is effective and has normal crossings support but is not 
necessarily reduced, then |(1 — €)F | = D — Drea, and therefore 


I” (O(—D)) = O(-(D — Drea)). 


2 Definitions from Nevanlinna theory 


This section gives, for the convenience of the reader, various standard definitions 
from Nevanlinna theory, as well as their counterparts in number theory. 
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We start with the standard definitions of Nevanlinna theory. First, a Weil function 
for a divisor D on X is a continuous function Ap: X \ Supp D — R such that if D 
is locally represented by a principal divisor (f) on an open set U, then Ap + log | f | 
extends to a continuous function on all of U. 

For a holomorphic curve f:C — X whose image does not lie in Supp D, the 
proximity function for f relative to D is then defined as 

2n 


.,. ad 
m(D,r) = : An( fre!) —— 


for all r > 0. This depends on the choice of Weil function A p for D, but this affects 
the proximity function only by a bounded amount (depending only on the two Weil 
functions). 

To define the counting function of f relative to D, we first let ord, A denote the 
multiplicity of an analytic divisor A on C at a point z. The counting function for f 
with respect to D is then 

N;(D,r) = > (ord, f* D) log a 


0<|z|<r 


| + (ordo f* D) logr, (2.1) 
z 


and the characteristic (or height) function is defined to be 
Tp. ¢(r) = mz(D,r) + Ne (D,r). 


It is well known that the latter depends up to O(1) only on the linear equivalence 
class of D, so it is possible to define the characteristic function T/p, ¢ (7) for a divisor 
class D (or for a line sheaf). 


Definition 2.2. A divisor D on X is big if there is a constant c > 0 such that 
h®(X,nD) > cn*™* for all sufficiently large and divisible integers 7. 


It is known, and easy to check, that the proximity, counting, and height functions 
are linear in D. Moreover, if D is a big divisor and f has Zariski-dense image, then 
the height is as big as possible, up to a multiplicative constant. If D is big and D’ 
is another divisor, then Tp, f(r) < c Tp,¢(r) + O(1) for some constant c. Thus, 
heights relative to big divisors occur frequently in error terms. For details on these 
assertions, see [7, § 11]. 

A slightly more general situation involves finite ramified coverings. These 
correspond to considering algebraic points instead of rational points in number 
theory. The general setup is that B is a connected Riemann surface, p:B — C 
is a proper surjective holomorphic map, and f: B — X is holomorphic. 

f 
B— Xx 


lp (2.3) 
C 
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For real r > 0 let 
Bir) ={be B: |p(b)| =r} 
and Bir) = {b € B: |p(b)| <r}, 


1 ef de 
— deg p? 20 
on Bir). 


Then the proximity function can be defined in this context as 


and let o be the measure 


m(D.r) = | Aaof-o, 
Bir) 


and the counting function can be defined as 


N;(D.1r) = Nexo), 


Na(r) = —Z > (ord, A) log — + ye (ord, A) logr 
Y be B(r)\p—!(0) Ip) bep—!(0) 

(2.4) 
for an analytic divisor A on B. We also define the ramification counting function 
for p to be 

Nram(p) (71) = Nr(r) , 


where R is the ramification divisor of p. 

For more details on these definitions, see [7, § 26]. 

The corresponding definitions in the number field (or global field) case are as 
follows. 

Let k be a global field, and let M;, denote its set of places. If k is a number field, 
then M;, is the disjoint union of a set Soo, which is in one-to-one correspondence 
with the set of embeddings 0: k — C modulo complex conjugation, and a set which 
is in one-to-one correspondence with the set of nonzero prime ideals of the ring of 
integers ©; of k. Norms || - ||, associated to places v € M; are normalized so that 
if v € Soo, then ||x||, = |o(x)|"™l, where v corresponds to o:k < C and k, is 
the local field at v; and if v ¢ Soo, then ||z||, = (@ : p)~!, where v corresponds 
top C G, and x € p \ p’. For function fields, the set of places and their associated 
norms is assumed to be given with the function field, and Soo = @. For either type 
of field, the product formula 

T] tsb =1 


veM, 


holds for all x € &* (by assumption, in the case of function fields). 
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Let D be a divisor on X. A Weil function Ap is a real-valued function on 
vem, (X \ Supp D)(k,); its restriction to (X \ Supp D)(k,) is denoted A py. Each 
A py is similar to a Weil function in Nevanlinna theory, normalized like — log || f ||, 
but there are additional conditions as v varies, as well as extra considerations due to 
the fact that k, is not locally compact. The details are too extensive to give here; see 
[2, Ch. 10]. Another (incomplete) reference is [7, § 8]. 

Now let P € X(k) be a point not lying on the support of D, and let S D> Soo be 
a finite set of places of k. Then the proximity function for P relative to D (and S) is 
defined as 


ms(D, P) =) Apw(P), 


ves 


the counting function is defined as 


Ns(D, P) = ))Aps(P), 
véS 


and the height is defined as 


hpx(P) = ms(D, P) + Ns(D, P) = > Av (P). 
ve M, 


These are linear in D, and again the height depends up to O(1) only on the linear 
equivalence class of D. These definitions depend on the choice of Weil function 
Ap, but only up to a constant depending only on the Weil functions involved. The 
definitions also depend on k, but k may be omitted from the notation since it is 
usually fixed. 

As in the case of Nevanlinna theory, if 3’ C X(k) is a set for which all infinite 
subsets are Zariski-dense, if D is a big divisor, and if D’ is any other divisor, 
then hp ,.(P) < chp,x(P) + O(1) for some constant c. For more details, see 
[7, Sects. 9-10]. 7 

These definitions generalize to points P € X(k) as follows. Assume P ¢ 
Supp D, and let L be a finite extension of & containing k(P). For places w € Mz, 
we write w | S if w lies over a place v € S, and w + S otherwise. Noting that 
Ly = k, if w | v, we write Ap, = [Ly : ky]Ap,. We then define the proximity, 
counting, and height functions as 


ms(D, P) = a rs dae), 
weEM, 
w|S 


Ns(D, P) = eg 2 AopplP); 


lem: 
wtS 
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and 


1 
hoa (P) = ms(D, P) + Ns(D, P) = Fe >> Ane). 


weM, 


These expressions do not depend on the choice of L. 
Corresponding to Nram(p) (7) in Nevanlinna theory, we consider the discriminant 
of k(P) for an algebraic point: 


log |Dicp)| 
[k(P) : k] 


where D; denotes the discriminant of the number field k. The corresponding 
definition in the function field case involves the ramification divisor of the morphism 
of curves corresponding to the field inclusion k <> L,; it is left to the reader as an 
exercise. For more details on discriminants, see [7, § 23]. 


dy (P) = — log |Dx\, 


3 Proximity functions for ideal sheaves 


Silverman [5, 2.2] introduced Weil functions associated to sheaves of ideals on X. 
By [5, Thm. 2.1], there is a unique way to associate to each ideal sheaf a 4 (0) of 
X a Weil-like function A, on X \ Y, where Y is the closed subscheme associated to 
a, such that A, = Ap is a Weil function in the usual sense if a = @(—D) for some 
effective Cartier divisor D, and Ag45 = min{A,q, Ap} for all nonzero ideal sheaves 
a and b of X. Here uniqueness and equality are up to addition of functions bounded 
by M,-constants. An M;-constant is a function v } c, from M; to R such that 
cy = 0 for all but finitely many v. 
Weil functions of ideal sheaves also satisfy the following conditions: 


(3.1) They are functorial in the sense that if f: X’ — X is a morphism of complete 
varieties with f(X’) ZY, thenA seq =Aqo f. 
(3.2) Ifa C b are ideal sheaves on X, thenA,g > Ap. 


See also Noguchi [4] and Yamanoi [8, 2.2]. They used similar Weil functions to 
define proximity functions relative to ideal sheaves. These are defined as follows. 
Let f:C — X bea holomorphic curve whose image is not entirely contained in Y. 
Then we define the proximity function m + (a,1r) in the usual way: 


20 , dé 
mylar) =f dal flre 


with the obvious adaptations in the case of (2.3). Similarly, if X is a variety over a 
global field k, if P € X(L) for some finite extension L of k, and if S D Soo isa 
finite set of places of k, then 


1 
ms(a, P) = Te 2 ea 
w|S 
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This expression is independent of the choice of L. Again, these proximity functions 
agree (up to O(1)) with m ¢(D,r) and ms(D,71r), respectively, when a = O(—D). 
They also satisfy (3.1) and (3.2) (again, up to O(1)). 


4 Conjectures 


In Nevanlinna theory, we make the following conjecture: 


Conjecture 4.1. Let X be a nonsingular complete complex variety, let K be the 
canonical divisor class on X, let a # (0) be an ideal sheaf on X, let A be a big 
divisor on X, and let € > O. Then there is a proper Zariski-closed subset Z of X, 
depending only on X, a, A, and e, such that if p: B > Cand f: B > X are as in 
(2.3) and the image of f is not contained in Z, then 


Tk, f(r) =F my (a, r) = my(F% (a), r) Sexe € Ta (7) 5 Nram(p) (1) + O(1). 


Here the subscript “exc” means that the inequality holds outside of a set of r of finite 
Lebesgue measure. The implicit constant in O(1) is independent of r but depends 
on all other data. 


The corresponding conjecture in number theory is: 


Conjecture 4.2. Let k be a global field of characteristic zero, let S D Sogo be a finite 
set of places of k, let r be a positive integer, let X be a nonsingular complete variety 
over k, let K be the canonical divisor class of X, let a # (0) be an ideal sheaf on 
X, let A be a big divisor on X, and let € > 0. Then there is a proper Zariski-closed 
subset Z of X, depending only onk, S, X,a,r, A, and €, such that 


hxx(P) +ms(a, P)—ms(-4% (a), P) < €hag(P) + dk(P) + OV) 


forall P € (X \ Z)(k) with [k(P) : k] < r. Again, the implicit constant in O(1) is 
independent of P but depends on all other data. 


These conjectures obviously generalize earlier conjectures in each case. Indeed, 
let D be a normal crossings divisor and let a = @(—D). Then m¢(.% (a), r) = 
O(1) and m ¢(a,r) = m ¢(D,1r), and likewise in the diophantine case. 


Proposition 4.3. Conjectures 4.1 and 4.2 are equivalent to their respective special 
cases in which a = O(—D) with D as above. 


Proof. Let wu: X' > X, Kx-/x, and F be as in the definition of multiplier ideal 
sheaf, and choose 7 > 0 such that .4%~ (a) = .¥%(a'~”). In the Nevanlinna case, let 
g:B — X’ bea lifting of f; then 
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Try, f(r) + me(a,r)—my(F% (a),1r) 
ST k yg (1)—Mg(Kxjx 1) +me(F,r)—mg(—Kyrjxt+|U-n) F].r) 
+0(1) 
= Try g(") + mg (Fra. r) + OC) 
Sexe € T4,¢(7) + Nram(p)(r) + O(1). 


Here we use the fact that 
wr F(a") = pw px Ox(Kxjx — (Ul — a) FI) S Ox (Kx — Ld 0) FJ) 
and therefore 
m ¢(F (a), 7) = mg(Ox(Kyrjx — (1 —n)F]).r) + OC) 
= m,(—Kxjx + [Ud —F],r)+ OU). 
The diophantine case is similar and is left to the reader. oO 


Remark 4.4. Although an arbitrary complete variety may not have a big line sheaf 
(or any nontrivial line sheaf) [1, pp. 25—26 and p. 72], a nonsingular complete variety 
always does. Indeed, let U be a nonempty open affine on a nonsingular complete 
variety X, pick generators x1,..., x, for the affine ring Gy (U), and let D be a Weil 
divisor whose support contains the polar divisors of all x;. Then D is big. 


5 Truncated counting functions 


Variations of the above conjectures using truncated counting functions can also be 
made. Truncated counting functions are defined in Nevanlinna theory as follows. 


Definition 5.1. Let ¢ be a positive integer, let D be an effective divisor on X, and 
let f:C — X bea holomorphic curve whose image is not contained in the support 
of D. Then the truncated counting function for f relative to D is 
NPD, r)= > min{ord, f* D, t} log Fi + minf{ordy) f*D,t}logr 
z 
(cf. (2.1)). In the more general situation of (2.3), the truncated counting function is 
NP, r)= Nis (), where in place of (2.4) we have 


0<|z|<r 


1 r 
NY (r) -_ asl minford, A, t} log ——— 
deg p "oe |p(b)| 


+ > non 4.0} 4967} 


bep—|(0) 
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Still more generally, if a # (0) is an ideal sheaf on X and if f:B > X isa 
holomorphic map whose image is not contained in the closed subscheme associated 
to a, then f*a is a nonzero analytic ideal sheaf on B. Since B is smooth and of 
dimension 1, any such ideal sheaf is of the form @(—A) for an analytic divisor A 
on B, and we define 


Ny(a,r) = Na(r) and NP (a, r)= NYO). 
Finally, we also define a height function 
Ta f(r) = my(a,r) + Ne(a,r). 


The conjecture in Nevanlinna theory is then: 


Conjecture 5.2. Let p:B — C be as in (2.3), let X be a nonsingular complete 
complex variety, let K be the canonical divisor class on X, let a 4 (0) be a sheaf 
of ideals on X, let A be a big divisor on X, and let € > 0. Then there is a proper 
Zariski-closed subset Z of X, depending only on deg p, X, a, A, and e€, such that 
the inequality 


Np (a,r) + Neam(p)(") Zexe Teg (1) + Ta. 0) = Te-(a.6 1) ~€ Tap (0) — OC) 


holds for all nonconstant holomorphic curves f: B — X whose images are not 
contained in Z. The implicit constant in O(1) is as in Conjecture 4.1. 


In the diophantine case, the corresponding definition and conjecture are as 
follows. 


Definition 5.3. Let ¢ be a positive integer, let S D> Soo be a finite set of places of 
k, let D be an effective divisor on X, let Ap be a Weil function for D, let P € 
X(k) \ Supp D, and let L = k(P). Then the truncated counting function of P 
relative to D is 


1 
(t) -_ ‘ 
Ns (D, P) = iL: k] » min{Apyw(P), =f, log lat lw}: 
wtS 


where xz, denotes an element of @; that lies in the prime ideal associated to w, 
but not in the square of that ideal. Note that, because ramification may affect the 
truncation, L is fixed in the above expression. More generally, if a is a nonzero 
ideal sheaf on X and if A, is a Weil function associated to it, then the truncated 
counting function relative to a is defined as 


1 
(t) . 
Ny ’(a, P) = ioe ce 2s min{Agy(P), —t log ||zwllv}, 
wtS 
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and its non-truncated counterpart is defined analogously. Finally, we define a height 
relative to a in the usual way as 


hoa (P) = ms(@, P) + Ns(a,P)= yD Aan (P 


weM, 


Conjecture 5.4. Let k be a global field of characteristic zero, let S D Soo bea finite 
set of places of k, let r be a positive integer, let XY be a nonsingular complete variety 
over k, let K be the canonical divisor class of X, let a # (0) be an ideal sheaf on 
X, let A be a big divisor on X, and let € > 0. Then there is a proper Zariski-closed 
subset Z of X, depending only onk, S, X,a,r, A, and €, such that 


Ng? (a, P) + dk(P) > hee (P) + hae(P) —h.y—(a(P) — €hax(P) — O11) 
for all P € (X \ Z)(k) with [k(P) : k] < r. The implicit constant is as in 
Conjecture 4.2. 


In each case, if a = @(—D) with D a normal crossings divisor, then the above 
conjectures reduce to conjectures that have already been posed; see [6] for the 
diophantine case. 

Again, we have a converse: 


Proposition 5.5. Conjectures 5.2 and 5.4 are equivalent to their respective special 
cases in which a = O(—D) with D as above. 


Proof. In the diophantine case this follows by the same argument as before. Indeed, 
let wu: X’ — X, F, and n be as before; assuming that [6, Conj. 2.3] holds for Freq 
on X’, we have 
Ns? (a, P) + dk(P) = Ny? (Frea, P’) + di (P’) 

> Nky + Fea(P’) — € hysa(P’) — O(1) 

= hy, (P’)- Akyrjy (P’) + hr(P’)- AK yy +l0—m FIP’) 

—eh4(P) — O() 
> hxy(P) + ha(P) —hs-(a(P) —eha(P) — O11), (5.5) 


where P’ € X’ lies over P € X. The proof in the Nevanlinna case is analogous. O 
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Introduction 


The history of Diophantine approximation is quite old: it includes, for instance, 
early estimates for 2, computations related to astronomical studies, and the theory 
of continued fraction expansion. 

There are positive results: any irrational number has good rational approxi- 
mations. One of the simplest tools to see this is Dirichlet’s box principle; other 
methods are continued fraction expansions, Farey series; and geometry of numbers 
(Minkowski’s Theorem). There are negative results: no number has too good (and at 
the same time too frequent) approximations. Some results are valid for all (irrational) 
numbers, others only for restricted classes of numbers, like the class of algebraic 
numbers. There is a metric theory ($1.2) which deals with almost all numbers in the 
sense of the Lebesgue measure. 

One main goal of the theory of Diophantine approximation is to compare, on the 
one hand, the distance between a given real number & and a rational number p/q, 
with, on the other hand, the denominator g of the approximant. An approximation 
is considered sharp if |& — p/q| is small compared to q. This subject is a classical 
one; there are a number of surveys, including those by S. Lang [78, 80-82]. Further 
general references are [26, 36, 46, 59, 60, 68, 75, 124]. 

The works by J. Liouville, A. Thue, C.L. Siegel, FJ. Dyson, A.O. Gel’ fond, 
Th. Schneider and K.F. Roth essentially solve the question for the case where & is 
algebraic. In a different direction, a lot of results are known which are valid for 
almost all numbers, after Khintchine and others. 

Several questions arise in this context. One may consider either asymptotic or 
else uniform approximation. The former asks to only for infinitely many solutions 
to some inequality, while the latter requires that occurrences of such approximations 
be not too lacunary. As a consequence, one introduces in §1.1 two exponents for the 
rational approximation to a single real number &, namely w(&) for the asymptotic 
approximation and @(é) for the uniform approximation; a lower bound for such an 
exponent means that sharp rational approximations exist, an upper bound means that 
too sharp estimates do no exist. To indicate with a “hat” the exponents of uniform 
Diophantine approximation is a convention which originates in [41]. 

In this context a new exponent, v(&), inspired by the pioneering work of R. Apéry 
in 1976 on €(3), has been introduced recently by T. Rivoal and S. Fischler (§1.3). 

After rational approximation to a single real number, several other questions 
naturally arise. One may investigate, for instance, the algebraic approximation 
properties of real or complex numbers, replacing the set of rational numbers by 
the set of real or complex algebraic numbers. Again, in this context, there are two 
main points of view: either one considers the distance |— — a| between the given 
real or complex number & and algebraic numbers a, or else one investigates the 
smallness of | P(&)| for P a non-zero polynomial with integer coefficients. In both 
cases there are two parameters, the degree and the height of the algebraic number 
or of the polynomial, in place of a single one in degree 1, namely q for & — p/q or 
for P(X) = qX — p. Algebraic and polynomial approximations are related: on the 


Recent advances in Diophantine approximation 661 


one hand (Lemma 9), the irreducible polynomial of an algebraic number close to & 
takes a small value at £, while on the other hand (Lemma 10), a polynomial taking a 
small value at & is likely to have a root close to &. However, these connections are not 
completely understood yet: for instance, while it is easy (by means of Dirichlet’s box 
principle — Lemma 15) to prove the existence of polynomials P having small values 
| P(&)| at the point &, it is not so easy to show that sharp algebraic approximations 
exist (cf. Wirsing’s conjecture 37). 

The occurrence of two parameters raises more questions to investigate: often one 
starts by taking the degree fixed and looking at the behaviour of the approximations 
as the height tends to infinity; one might do the opposite, fix the height and let 
the degree tend to infinity: this is the starting point of a classification of complex 
numbers by V.G. Sprindzuk (see [36] Chap. 8 p. 166). Another option is to let the 
sum of the degree and the logarithm of the height tend to infinity: this is the choice of 
S. Lang who introduced the notion of size [79] Chap. V, in connection with questions 
of algebraic independence [78]. 

The approximation properties of a real or complex number & by polynomials of 
degree at most n (§2.3) will give rise to two exponents, w,(€) and @,(&), which 
coincide with w(&) and @(€) for n = 1. Gel’fond’s transcendence criterion (§2.2) is 
related to an upper bound for the asymptotic exponent of polynomial approximation 
@n(€) valid for all transcendental numbers. 

The approximation properties of a real or complex number & by algebraic 
numbers of degree at most 1 (§2.5) will give rise to two further exponents, an 
asymptotic w,*(&) and a uniform @*(&), which also coincide with w(&) and @(&) 
forn = 1. 

For a real number &, there is a third way of extending the investigation of rational 
approximation, which is the study of simultaneous approximation by rational 
numbers of the n-tuple (€, €?,... ,€”). Once more there are an asymptotic exponent 
w/ (€) and a uniform @} (&); again they coincide with w(&) and @(&) forn = 1. 
These two new exponents suffice to describe the approximation properties of a real 
number by algebraic numbers of degree at most 7 (the star exponents), thanks to a 
transference result (Proposition 40) based on the theory of convex bodies of Mahler. 

Several relations among these exponents are known, but a number of problems 
remain open: for instance, for fixed n > | the spectrum of the sextuple 


(on(E), @n(E), 4G), OE), oF E), Oy (E)) € R° 


is far from being completely understood. As we shall see for almost all real numbers 
€ and for all algebraic numbers of degree > n, 


On (&) = On(E) = 0, (€) =O, (€) =n and w,(€) = 0), (§) = 1/n. 


A review of the known properties of these six exponents is given in [41]. We shall 
repeat some of these facts here (beware that our notation for w/ and @/, are the A, 
and de from [41], which are the inverses of their wi and wi , also used by Y. Bugeaud 
in §3.6 of [36]; here we wish to be compatible with the notation of M. Laurent in 
[91] for the higher-dimensional case). 
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Among a number of new results in this direction, we shall describe those 
achieved by D. Roy and others. In particular, a number of results for the case n = 2 
have been recently obtained. 

Simultaneous approximations to a tuple of numbers is the next step. The subspace 
theorem of W.M. Schmidt (Theorem 1B in Chapter VI of [124]), which is a powerful 
generalisation of the Thue—Siegel—Roth Theorem, deals with the approximation of 
algebraic numbers. It says that tuples of algebraic numbers do behave like almost all 
tuples. It is a fundamental tool with a number of deep consequences [29]. Another 
point of view is the metrical one, dealing with almost all numbers. Further questions 
arise which should concern all tuples, and these considerations raise many open 
problems. We shall report on recent work by M. Laurent who introduces a collection 
of new exponents for describing the situation. 

We discuss these questions mainly in the case of real numbers. Most results 
(so far as they are not related to the density of Q into R) are valid also for 
complex numbers with some modifications (however see [40]), as well and for non- 
Archimedean valuations, especially p-adic numbers but also (to some extent) for 
function fields. We make no attempt to be exhaustive. There are a number of related 
issues which we do not study in detail here — sometimes we just give a selection of 
recent references. Among them are 


¢ Questions of inhomogeneous approximation. 

¢ Littlewood’s Conjecture ((36], Chap. 10). 

¢ Measures of irrationality, transcendence, linear independence, algebraic inde- 
pendence of specific numbers. Effective refinements of Liouville’s Theorem are 
studied in [30] (see also Chap. 2 of [36]). 

¢ Results related to the complexity of the development of irrational algebraic 
numbers, automata, normality of classical constants (including irrational alge- 
braic numbers) — the Bourbaki lecture by Yu. Bilu [29] on Schmidt’s subspace 
Theorem and its applications describes recent results on this topic and gives 
further references. 

¢ Connection between Diophantine conditions and dynamical systems. 

e Diophantine questions related to Diophantine geometry. Earlier surveys dealing 
extensively with this issue were written by S. Lang. A recent reference on this 
topic is [70]. 

¢ In a preliminary version of the present paper, the list of topics which were 
not covered included also refined results on Hausdorff dimension, Diophantine 
approximation of dependent quantities and approximation on manifolds, and 
hyperbolic manifolds, also the powerful approach initiated by Dani and Margulis, 
developed by many specialists. We quote here V.V. Beresnevich, V.I. Bernik, 
H. Dickinson, M.M. Dodson, D.Y. Kleinbock, El. Kovalevskaya, G.A. Margulis, 
F. Paulin, S.L. Velani. However, thanks to the contribution of Victor Beresnevich 
and Maurice Dodson who kindly agreed to write Sections 2.7 and 3.6 (and also 
to contribute by adding remarks, especially on §1.2), these topics are no longer 
excluded. 
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We discuss only briefly a few questions of algebraic independence; there is 
much more to say on this matter, especially in connection with Diophantine 
approximation. Although we quote some recent transcendence criteria as well as 
criteria for algebraic independence, we do not fully cover this topic either (and do 
not mention criteria for linear independence). 
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1 Rational approximation to a real number 
1.1 Asymptotic and uniform rational approximation: @ and ® 
Since Q is dense in R, for any — € Rand any « > 0 there exists b/a € Q for which 


<€. 


b 
e-7 
a 


Let us write the conclusion 
la& —b| <a. 


It is easy to improve this estimate: Let a € Zo and let b be the nearest integer to 
aé. Then 
la —b| < 1/2. 


A much stronger estimate is due to Dirichlet (1842) and follows from the box or 
pigeonhole principle; see for instance [78], [60], [124] Chap. I, sTh. 1A: 


Theorem 1 (Uniform Dirichlet’s theorem). For each real number N > 1, there 
exist gq and p in Zwith 1 < q < N such that 


i 
— < — 
Ig& — pl x 


As an immediate consequence ([124] Chap. I Cor. 1B): 


Corollary 2 (Asymptotic Dirichlet’s theorem). /f € € R is irrational, then there 
exist infinitely many p/q € Q for which 


1 
bled 
q q 
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Our first concern is to investigate whether it is possible to improve the uniform 
estimate of Theorem | as well as the asymptotic estimate of Corollary 2. 

We start with Corollary 2. Using either the theory of continued fractions or Farey 
series, one deduces a slightly stronger statement, proved by A. Hurwitz in 1891 
(Theorem 2F in Chap. I of [124]): 


Theorem 3 (Hurwitz). For any real irrational number &, there exist infinitely 


many p/q € Q such that 
1 


J5q2 


For the golden ratio y = (1 + /5)/2 and for the numbers related to the golden 
ratio by a homographic transformation (ax + b)/(cx + d) (where a, b, c, d are 
rational integers satisfying ad — bc = +1), this asymptotic result is optimal. 
For all other irrational real numbers, Hurwitz proved that the constant /5 can be 
replaced by v8 ({124] Chap. I Cor. 6C). These are the first elements of the Lagrange 
spectrum: V5, V8, /221/5, 1517/13, ..., (references are given in Chap.I, §6 of 
[124]; the book [47] is devoted to the Lagrange and Markov spectra). 

Lagrange noticed as early as 1767 (see [59] Chap. 1, Theorem 1.2) that for all 
irrational quadratic numbers, the exponent 2 in q? in the conclusion of Corollary 2 is 
optimal: more generally, Liouville’s inequality (1844) produces, for each algebraic 
number & of degree d > 2, a constant c(£) such that for all rational numbers p/q, 


ah 
q 


q q 
Admissible values for c(&) are easy to specify (Th. 1 Chap. 1 §1 of [128], [60] p. 6, 
Th. 1E of [125], Th. 1.2 of [36]). 
A Liouville number is a real number & for which the opposite estimate holds: for 
any k > 0, there exists a rational number p/q such that 


0<|j&-—] < —- (4) 


q q* 


2 << 


A very well approximable number is a real number & for which there exists k > 2 
such that the inequality (4) has infinitely many solutions. A nice example of such a 
number is 


Co 
25" 

n=1 
for « a real number > 2. This number belongs to the middle-third Cantor set K, 
which is the set of real numbers whose base-three expansions are free of the digit 
1. In [95], J. Levesley, C. Salp, and S.L. Velani show that &, is an element of K 
with irrationality exponent w(&) = « fork > (3+ V5)/2 and > k for2 < K < 
(3 + /5)/2. This example answers a question of K. Mahler on the existence of 
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very well approximable numbers which are not Liouville numbers in K. In [38], 
Y. Bugeaud shows that y(&,) = « fork > 2, and more generally, that for « > 2 and 
A > 0, the number 


Cc 
fie 2 p03 


n=no 


has j1(Ei.) = K. 

Given x = 2, denote by E, the set of real numbers & satisfying the following 
property: the inequality (4) has infinitely many solutions in integers p and q, and 
for any c <1 there exists qo such that, for q = qo, 


e-2 > (5) 
q 


Cc 
q* 


Then for any k > 2 this set FE, is not empty. Explicit examples were given by 
Jarnik in 1931 (see [2] for a variant). In [20], V.V. Beresnevich, H. Dickinson and 
S.L. Velani raised the question of the Hausdorff dimension of the set F',. The answer 
is given by Y. Bugeaud in [35]: this dimension is 2/k. 

We now consider the uniform estimate of Theorem |. Let us show that for any 
irrational number &, Dirichlet’s theorem is essentially optimal: one cannot replace 
1/N by 1/(2N). This was already observed by Khintchine in 1926 [74]: 


Lemma 6. Let & be a real number. Assume that there exists a positive integer No 
such that for each integer N > No, there exista € Zandbé€Zwithl1<a<N 
and 


1 


Then & is rational and aE = b for each N = No. 


Proof. By assumption for each integer N > No there existay € Zand by € Zwith 
1 <ay < WN and 


1 
—b eo" 
lan& n| aN 


Our goal is to check ay & = by foreach N > No. 
Let N > No. Write (a, b) for (ay, by) and (a’, b’) for (ay 41, bn 41): 


(l<a'<N). 


1 1 
—b —i(1<a<N-l), 'E—D! 
lag -b] <= (<a <N=1), [afd < 
Eliminate € between a& — b and a’& — b’: the rational integer 


ab’ —a'b = a(b' —a’é) + a'(a&é —b) 


satisfies |ab’ — a'b| < 1; hence it vanishes and ab’ = a’b. 
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Therefore the rational number by /ay = by+1/ay+ , does not depend on 
N = No. Since 
sim, ban = 
it follows that € = by /ay for all N > No. oO 


Remark. As pointed out to me by M. Laurent, an alternative argument is based on 
continued fraction expansions. 


Coming back to Theorem | and Corollary 2, we associate to each real irrational 
number & two exponents w and @ as follows. 

Starting with Corollary 2, we introduce the asymptotic irrationality exponent of 
a real number &, which is denoted by w(&): 


w(&) = sup \w: there exist infinitely many (p,q) € Z’ 
with q > 1 and 0 < |g&é — p| < qt. 


Some authors prefer to introduce the irrationality exponent (€) = w(—) + 1 of € 
which is denoted by jz(&): 


L(&) = sup {u: there exist infinitely many (p,q) € Z” 


22 
q 


with q > 1 and0 < ag *}. 


An upper bound for w(€) or j2(&) is an irrationality measure for &, namely a lower 
bound for |& — p/q| when p/q € Q. 

Liouville numbers are the real numbers € with w(&) = w(€) = oo. 

Since no set F, (see property (5)) with k > 2 is empty, the spectrum {o(&) ;€e 
R\ Q} of w is the whole interval [1, ++-oo], while the spectrum {(&) ;E€ER\ Q} 
of pz is [2, +00]. 

According to the theorem of Thue—Siegel—Roth [29], for any real algebraic 
number — € R\ Q, 


o(§) =1. 


We shall see (in §1.2) that the same holds for almost all real numbers. 
The other exponent related to Dirichlet’s Theorem | is the uniform irrationality 
exponent of &, denoted by @(é): 


@(€) = sup {w: for any N > 1, there exists (p,q) € Z” 


with | <q <N and0 <|gé—p| < nh. 
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In the singular case of a rational number € we set w(€) = @(&) = 0. It is plain from 
the definitions that for any € € R \ Q, 


o(§) 2 Oo) 2 1. 


In fact Lemma 6 implies that for any § € R\ Q, @(€) = 1. Our motivation to 
introduce a notation for a number which is always equal to 1 is that it will become 
non-trivial in more general situations (@, in §2.3, @/, in §2.4, O* in §2.5). 


1.2. Metric results 


The metric theory of Diophantine approximation provides statements which are 
valid for almost all (real or complex) numbers, which means for all numbers outside 
a set of Lebesgue measure 0. Among many references on this topic, we quote 
[26, 36, 69, 129, 130]. See also §2.7 and §3.6 below. 

One of the early results is due to Capelli: for almost all € € R, 


w(&) = O(§)=1 and pe) =2. 


This is one of many instances where irrational algebraic numbers behave like almost 
all numbers. However one cannot expect that all statements from Diophantine 
approximation which are satisfied by all numbers outside a set of measure 0 will 
be satisfied by all irrational algebraic numbers, just because such an intersection of 
sets of full measure is empty. As pointed out to me by B. Adamczewski, S. Schanuel 
(quoted by S. Lang in [78] p.184 and [83] Chap. II §2 Th. 6) gave a more precise 
formulation of such a remark as follows. 

Denote by K (for Khintchine) the set of non-increasing functions VY from Rs to 
R,o. Set 


WeK; > W(n) diverges 


n21 


Ke={sWVeEK; 2 W(n) comers , Ke= 


n21 


HenceK = K,UKy. 

A well-known theorem of A. Ya. Khintchine in 1924 (see [73], [75] and Th. 
1.10 in [36]) has been refined as follows (an extra condition that the function x t~ 
x?W(x) be decreasing has been dropped) — see Beresnevich, Dickinson, and Velani 
[21]: 


Theorem 7 (Khintchine). Let Y ¢€ K. Then for almost all real numbers &, the 
inequality 
lag — p| < ¥@) (8) 
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has 


¢ only finitely many solutions in integers p and q ifW € K, 
¢ infinitely many solutions in integers p and q if € Kg. 


S. Schanuel proved that the set of real numbers which behave like almost all 
numbers from the point of view of Khintchine’s theorem in the convergent case has 
measure 0). More precisely, the set of real numbers & such that for any smooth convex 
function W € K,, the inequality (8) has only finitely many solutions is the set of real 
numbers with bounded partial quotients (badly approximable numbers — see [124] 
Chap. I §5; other characterisations of this set are given in [83] Chap. II §2 Th. 6). 

Moreover B. Adamczewski and Y. Bugeaud noticed that given any irrational &, 
either there exists a W € Ky for which 


Ig& — p| < Yq) 


has no integer solutions or there exists a W € K,. for which 


Ig& — p| < ¥(q) 


has infinitely many integer solutions. 


1.3. The exponent v of S. Fischler and T. Rivoal 


Let € € R\ Q. In [63], S. Fischler and T. Rivoal introduce a new exponent v(&) 
which they define as follows. 

When u = (u,)n>1 iS an increasing sequence of positive integers, define another 
sequence of integers v = (Vn)n>1 by |unE —vn| < 1/2 (Le. v, is the nearest integer 
to u,&) and set 


a; (u) = lim sup [ene = Vasil B(u) = lim sup 


Mn+) | 
noo |uné = Vn | noo =Un 


Then 


v(&) = inflog ae (uw) Bw), 


where u ranges over the sequences which satisfy a¢(u) < 1 and B(u) < +oo. Here 
we agree that inf @ = +o. 
They establish a connection with the irrationality exponent by proving 


_ logBW 


mG) <1 log a (u) 


As a consequence, if v(€) < +00, then (&) < +00. 
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If € is quadratic, Fischler and Rivoal produce a sequence u with ag (u)B(u) = 1, 
hence v(&) = 0. 

This new exponent v is motivated by Apéry-like proofs of irrationality and 
measures. Following the works of R. Apéry, A. Baker, F. Beukers, G. Rhin and 
C. Viola, and M. Hata among others, S. Fischler and T. Rivoal deduce 


v(2'/3) < (3/2)log2, v(£(3)) <3, v(n?) <2, v(log2) <1 


Also v(zr) < 21 

The spectrum of v(&) is not yet known. According to [63], for any § €R\Q, 
the inequalities 0 < v(€) < +00 hold. Further, for almost all § € R, v(&) = 0. 
Furthermore, S. Fischler and T. Rivoal, completed by B. Adamczewski [1], proved 
that any irrational algebraic real number & has v(&E) < +00. 

There are examples of € € R \ Q for which v(&) = +00, but all known examples 
with v(&) = +00 so far have (€) = +00 

Fischler and Rivoal ask whether it is true that v(€) < +00 implies u(&) = 2. 
Another related question they raise in [63] is whether there are numbers & with 
0 < v(E) < +00. 


2 Polynomial, algebraic, and simultaneous approximation 
to a single number 


We define the (usual) height H(P) of a polynomial 
P(X) = dp tayX +-:-+a,X" 


with complex coefficients as the maximum modulus of its coefficients, while its 
length L(P) is the sum of the moduli of these coefficients: 


H(P) = qmax ax |ai|, L(P) = Die. 


The height H(@) and length L(q@) of an algebraic number « are the height and length 
of its minimal polynomial over Z. 


2.1 Connections between polynomial approximation 
and approximation by algebraic numbers 


Let & be a complex number. To produce a sharp polynomial approximation to & is 
to find a non-zero polynomial P € Z[X] for which | P(€)| is small. An algebraic 
approximation to & is an algebraic number a@ such that the distance |F — a| between 
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— and a is small. There are close connections between both questions. On the one 
hand, if |P(&)| is small, then € is close to a root a of P. On the other hand, if 
|& — a| is small then the minimal polynomial of ~ assumes a small value at &. These 
connections explain that the classifications of transcendental numbers in S, 7, and 
U classes by K. Mahler coincide with the classifications of transcendental numbers 
in S*, 7*, and U* classes by J.F. Koksma (see [128] Chap. III and [36] Chap. 3). 

The easy part is the next statement (Lemma 15 Chap. III §3 of [128], §15.2.4 of 
[138], Prop. 3.2 §3.4 of [36]). 


Lemma 9. Let f € C[X] be a non-zero polynomial of degree D and length L, let 
a € Cbearoot of f and let & € C satisfy |& —a| < 1. Then 


|fG)| <|§ —@|LDC + |g). 


The other direction requires more work (see Chap. III §3 of [128], §3.4 of [36]). 
The next result is due to G. Diaz and M. Mignotte [52] (cf. Lemma 15.13 of [138]). 


Lemma 10. Let f € Z[X] be a non-zero polynomial of degree D. Let & be a 
complex number, a a root of f at minimal distance of — and k the multiplicity of a 
as a root of f. Then 


lf — alk < DPCH(f)? | FOI 


Further similar estimates are due to M. Amou and Y. Bugeaud [7]. 


2.2 Gel’fond’s transcendence criterion 


The so-called transcendence criterion, proved by A.O. Gel’fond in 1949, is an 
auxiliary result in the method he introduced in [64—66] (see also [67] and [68]) for 
proving algebraic independence results. An example is the algebraic independence 
of the two numbers 2 ¥? and 24. More generally, he proved that if is a non—zero 
algebraic number, log a a non-zero logarithm of « and B an algebraic number of 
degree d > 3, then at least 2 among the d — \ numbers 


are algebraically independent. Here a* stands for exp(z log a). 

While the Gel’ fond—Schneider transcendence method for solving Hilbert’s sev- 
enth problem on the transcendence of «? relies on a Liouville type estimate, namely 
a lower bound for a non-zero value | P(€)| of a polynomial P at an algebraic point &, 
Gel’ fond’s method for algebraic independence requires a more sophisticated result, 
namely the fact that there is no non-trivial uniform sequence of polynomials taking 
small values at a given transcendental number. 

Here is a version of this transcendence criterion [93, 131]. 
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Theorem 11 (Gel’fond’s transcendence criterion). Let & € C. Assume there is a 
sequence (Py) n>N, Of non-zero polynomials in Z[X], where Py has degree < N 
and height H(Py) < e%, for which 


|PwiG)] se. 
Then & is algebraic and Py(&) = 0 forall N = No. 


Proof (sketch of). The idea of the proof is basically the same as for Lemma 6 which 
dealt with degree-1 polynomials: one eliminates the variable using two consecutive 
elements of the sequence of polynomials. In degree | linear algebra was sufficient. 
For higher degrees the resultant of polynomials is a convenient substitute. 

Fix N > No. Since | Px (&)| is small, € is close to a root ay of Py, hence Py is 
divisible by a power Qy of the irreducible polynomial of ay and |Q(&)| is small. 
The resultant of the two polynomials Oy and Oy + has absolute value < 1; hence 
it vanishes and therefore a, does not depend on NV. oO 


In 1969, H. Davenport and W.M. Schmidt ([49] Theorem 2b) prove the next 
variant of Gel’fond’s transcendence criterion, where now the degree is fixed. 


Theorem 12 (Davenport and Schmidt). Let € be a real number and n>2 a 
positive integer. Assume that for each sufficiently large positive integer N there 
exists a non-zero polynomial Py € Z[X] of degree <n and usual height < N for 
which 

|Py(a)| << NO-"TT, 


Then & is algebraic of degree < n. 


The next sharp version of Gel’fond’s transcendence criterion 11, restricted to 
quadratic polynomials, is due to B. Arbour and D. Roy, 2004 [8]. 


Theorem 13 (Arbour and Roy). Let & be a complex number. Assume that there 
exists No > 0 such that for any N > No, there exists a polynomial Py € Z[X] of 
degree < 2 and height < N satisfying 


1 
Pu) <gNO. 


Then & is algebraic of degree < 2. and Py(&) = O forall N = No. 


Variants of the transcendence criterion have been considered by D. Roy in 
connection with his new approach towards Schanuel’s conjecture [138] §15.5.3: 


Conjecture 14 (Schanuel). Let x),...,X, be Q-linearly independent complex 
numbers. Then n at least of the 2n numbers x,,...,Xn, e“!,...,e" are alge- 
braically independent. 


In [111, 112], D. Roy states a Diophantine approximation conjecture, which he 
shows to be equivalent to Schanuel’s Conjecture 14. 
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Roy’s conjecture involves polynomials for which bounds for the degree, height, 
and absolute values at given points are assumed. The first main difference with 
Gel’fond’s situation is that the smallness of the values is not as strong as what 
is achieved by Dirichlet’s box principle when a single polynomial is constructed. 
Hence elimination arguments cannot be used without further ideas. On the other 
hand, the assumptions involve not only one polynomial for each 7 as in Theorem 11, 
but a collection of polynomials, and they are strongly related. This new situation 
raises challenging questions on which some advances have already been achieved. 
In particular in [93] Laurent and Roy obtain variants of Gel’fond’s criterion 11 
involving multiplicities. Further progress has been subsequently made by D. Roy 
in [120, 121]. 

We shall consider extensions of the transcendence criterion to criteria for 
algebraic independence in §3.1. 


2.3. Polynomial approximation to a single number 


A simple application of Dirichlet’s box principle (see the proof of Lemma 8.1 in 
[36]) yields the existence of polynomials with small values at a given real point: 


Lemma 15. Let & be a real number and n a positive integer. Set c=(n + 1) 
max{1, |&|}”. Then, for each positive integer N, there exists a non-zero polynomial 
P €Z|X], of degree < n and usual height H(P) < N, satisfying 


|P@)|<cn™. 


Variants of this lemma rely on the geometry of numbers: for instance from Th. 
B2 in [36] one deduces that in the case 0 < |&| < 1/2,if N > 2, then the conclusion 
holds also with c = 1 (see the proof of Prop. 3.1 in [36]). 

Theorem | in §1 yields a refined estimate for the special case n = 1. The 
statement is plain in the case where & is algebraic of degree < n as soon as N 
exceeds the height of the irreducible polynomial of &; this is why when n is fixed 
we shall most often assume that € is either transcendental or else algebraic of 
degree > n. 

The fact that the exponent n in Lemma 15 cannot be replaced by a larger 
number (even if we seek such a solution only for infinitely many NV) was proved by 
Sprindzuk [129], who showed in 1965 that for & outside a set of Lebesgue measure 
zero and for each € > 0, there are only finitely many non-zero integer polynomials 
of degree at most n with 


|P(§)| <H(Py"™. 


We introduce, for each positive integer n and each real number £, two exponents 
w,(€) and @, () as follows. 
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The number q,,(€) denotes the supremum of the real numbers w for which there 
exist infinitely many positive integers N for which the system of inequalities 


0 < |xo txpE tes + xnE"|< N, imax xi] < N (16) 


has a solution in rational integers xo, x1,...,X,. The inequalities (16) can be written 
0<|PE)|<H(P)™, 


where P denotes a non-zero polynomial with integer coefficients and degree < n. 
A transcendence measure for & is a lower bound for | P(&)| in terms of the height 
H(P) and the degree deg P of P. Hence one can view an upper bound for w, (&) as 
a transcendence measure for &. 

These numbers arise in Mahler’s classification of complex numbers ([128] 
Chap. II §1 and [36] §3.1). 

A uniform version of this exponent is the supremum @, (&) of the real numbers w 
such that, for any sufficiently large integer NV, the same system (16) has a solution. 
An upper bound for @, (€) is a uniform transcendence measure for &. 

Clearly, from the definitions, we see that these exponents generalize those from 
$1.1: for n = 1, w,(€) = w(€) and @,(€) = @(E). From Lemma 15 one deduces, 
for any n = | and any € € R which is not algebraic of degree < n, 


nes On (E) < @y (&). (17) 


Moreover, @, < @n+1 and O, < @n+1. As a consequence, Liouville numbers have 
@n(&) = +00 foralln = 1. 

The value of the exponents w, and @, for almost all real numbers and for 
all algebraic numbers of degree > n is n. The following metric result is due to 
V.G. Sprindzuk [129]: 


Theorem 18 (Sprindzuk). For almost all numbers & € R, 
On (€) = On (§) =n foralln > 1. 


As a consequence of W.M. Schmidt’s subspace theorem one deduces (see [36] 
Th. 2.8 and 2.9) the value of w, (€) and ©, (&) for € algebraic irrational: 


Theorem 19 (Schmidt). Let n > 1 be an integer and & an algebraic number of 
degree d > n. Then 
Qn(€) = On(&) Hn. 


The spectrum of the exponent w, is [n, +00]. For @, it is not completely known. 
From Theorem 12 one deduces: 


Theorem 20 (Davenport and Schmidt). For any real number & which is not 
algebraic of degree <n, 
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For the special case n = 2 a sharper estimate holds: from Theorem 13 of 
B. Arbour and D. Roy one deduces 


@2(&) <yt+1 (21) 


(recall that y denotes the golden ratio (1 + /5)/2). 
In [49], Davenport and Schmidt comment: 


“We have no reason to think that the exponents in these theorems are best possible.” 


It was widely believed for a while that ©2(€) would turn out to be equal to 2 for 
all € € R which are not rational or quadratic irrationals. Indeed, otherwise, for « in 
the interval 2 < x < @(6), the inequalities 


0 <|xo +.xi& + x28*|<cN™, maxf{|xol, [xil, |xal}<N, (22) 


would have, for a suitable constant c > O and for all sufficiently large N, 
a non-trivial solution in integers (x9,x1,X2) € Z. However, these inequalities 
define a convex body whose volume tends to zero as N tends to infinity. In such 
circumstances one does not expect a non-trivial solution to exist.' In general 
@2(a, B) may be infinite (Khintchine, 1926; see [46]). However, here we have the 
restriction B = a”. 

Hence it came as a surprise in 2003 when D. Roy [113] showed that the estimate 
(21) is optimal, by constructing examples of real (transcendental) numbers & for 
which @2(€) = y + 1 =2.618.... 

By means of a transference principle of Jarnik (Th. 2 of [72]), Th. 1 of [115] can 
be reformulated as follows (see also Th. 1.5 of [116]). 


Theorem 23 (Roy). There exists a real number & which is neither rational nor 
a quadratic irrational and which has the property that for a suitable constant 
c > 0, for all sufficiently large integers N, the inequalities (22) have a solution 
(X0,X1,X2) € ZL with k = y + 1. Any such number is transcendental over Q and 
the set of such real numbers is countable. 


In [118], answering a question of Y. Bugeaud and M. Laurent [41], D. Roy shows 
that the exponents @2(&), where € ranges through all real numbers which are not 
algebraic of degree < 2, form a dense subset of the interval [2, 1 + y]. 

D. Roy calls extremal a number which satisfies the conditions of Theorem 23; 
from the point of view of approximation by quadratic polynomials, these numbers 
present a closest behaviour to quadratic real numbers. 

Here is the first example [113] of an extremal number &. Recall that the Fibonacci 
word 

w = abaababaabaababaababaabaababaabaab... 


'Compare with the definition of singular systems in §7, Chap. V of [46]. 
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is the fixed point of the morphism a +> ab, b +» a. It is the limit of the sequence of 
words starting with fj = b and f: = a and defined inductively by concatenation 
as fn = fn—1fn—2. Now let A and B be two distinct positive integers and let 
— € (0, 1) be the real number whose continued fraction expansion is obtained from 
the Fibonacci word w by replacing the letters a and b by A and B: 


[0; A, B, A, A, B, A, B, A, A, B, A, A, B, A, B, A, A,...]. 


Then & is extremal. 
In [113, 115,119], D. Roy investigates the approximation properties of extremal 
numbers by rational numbers, by quadratic numbers as well as by cubic integers. 


Theorem 24 (Roy). Let & be an extremal number. There exist positive constants 
C1,...,¢5 with the following properties: 


(1) For any rational number a € Q we have 
|§ —a| = c.H~? (log H)-® 


with H = max{2,H(a)}. 
(2) For any algebraic number a of degree at most 2 we have 


| —a| > c3H(a)?”. 

(3) There exist infinitely many quadratic real numbers a with 
| —a| < cyH(a)?”?. 

(4) For any algebraic integer a of degree at most 3 we have 
|E — a| > csH(a)”~. 


Moreover, in [114], he shows that for some extremal numbers &, property (4) 
holds with the exponent —y — | in place of —y — 2. 

In [116] D. Roy describes the method of Davenport and Schmidt and he gives a 
sketch of proof of his construction of extremal numbers. 

In [119] he gives a sufficient condition for an extremal number to have bounded 
quotients and constructs new examples of such numbers. 

The values of the different exponents for the extremal numbers which are 
associated with Sturmian words (including the Fibonacci word) have been obtained 
by Y. Bugeaud and M. Laurent [41]. Furthermore, they show that the spectrum 
{@(E); € € R\Q} is not countable. See also their joint works [42,43]. Their method 
involves words with many palindromic prefixes. S. Fischler in [61,62] defines new 
exponents of approximation which allow him to obtain a characterization of the 
values of @2(&) obtained by these authors. 
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In [3], B. Adamczewski and Y. Bugeaud prove that for any extremal number &, 
there exists a constant c = c(&) such that for any integer n > 1, 


@n(&) < exp{c(log(3n)) (log log(3n))?. 


In particular, an extremal number is either a S-number or a T-number in Mahler’s 
classification. 

Recent results on simultaneous approximation to a number and its square, on 
approximation to real numbers by quadratic integers and on quadratic approxima- 
tion to numbers associated with Sturmian words have been obtained by M. Laurent, 
Y. Bugeaud, S. Fischler, D. Roy, and other. 


2.4 Simultaneous rational approximation to powers of a real 
number 


Let € be a real number and n a positive integer. 

We consider first the simultaneous rational approximation of successive powers 
of &. We denote by @/ (€) the supremum of the real numbers w for which there exist 
infinitely many positive integers N for which the system 


0 < max |x; —x0&’'|< N~”, with max |x;| < N, (25) 
1<i<n O<i <n 
has a solution in rational integers Xo, X1,...,Xn. 


An upper bound for w/(&) yields a simultaneous approximation measure for 
Ee asege" 

Next the uniform simultaneous approximation measure is the supremum } (€) 
of the real numbers w such that for any sufficiently large integer NV, the same system 
(25) has a solution in rational integers x9, X1,...,Xn- 

Notice that for n = 1, | (&) = w(&) and 0 (€) = @(€). 

According to Dirichlet’s box principle, for all € and n, 


1 “aI ! 
= On (&) ~ On, (&). 
nN 


Khintchine’s transference principle (see Th. B.5 in [36] and Theorem 61 below) 
yields relations between w/, and w,. As remarked in Theorem 2.2 of [41], the same 
proof yields similar relations between @}, and @. 


Theorem 26. Let n be a positive integer and & a real number which is not algebraic 


of degree < n. Then 


1 an (&) ; =n 
7 = (n _ 1)@,(&) +n < w, (§) < ———— 
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and 


n ~ (n—Don(&) +n n 

The second set of inequalities follows from the inequalities (4) and (5) of 
V. Jarnik in Th. 3 of [72], with conditional refinements given by the inequalities 
(6) and (7) of the same theorem. 

In particular, w,(€) = n if and only if w/ (€) = 1/n. Also, ©, (€) = n if and 
only if @/,(€) = 1/n. 

The spectrum of w/ (&), where & ranges over the set of real numbers which are 
not algebraic of degree < n, is investigated by Y. Bugeaud and M. Laurent in [42]. 
Only the case n = 2 is completely solved. 

It follows from Theorem 18 that for almost all real numbers &, 


1 
ow) (€) =o) (§) = —  foralln > 1. 
n 


Moreover, a consequence of Schmidt’s theorem 19 is that for alln = 1 and for all 
algebraic real numbers & of degree d > n, 


j ~~ 1 1 
o, (§) = @,(€) = > w,(E) 


Theorems 2a and 4a of the paper [49] by H. Davenport and W.M. Schmidt (1969) 
imply that upper bounds for @/ (&) are valid for all real numbers & which are not 
algebraic of degree < n. For instance, 


Oi(E)=1, GE) <1/y =0.618..., &4(€) < 1/2. 


A slight refinement was obtained by M. Laurent [88] in 2003 (for the odd values of 
n= 5). 


Theorem 27 (Davenport and Schmidt, Laurent). Let § €R\Q and n=2. 
Assume & is not algebraic of degree < [n/2]. Then 


2/n ifn is even, 


A) =1. 
@,(§) < [n/2|~ = 2/(n+1)_ ifn is odd. 


The definition of @/, with a supremum does not reflect the accuracy of the results 
in [49]; for instance, the upper bound @4(&) < 1/y is not as sharp as Theorem la 
of [49] which is the following: 


Theorem 28 (Davenport and Schmidt). Let & be a real number which is not 
rational or a quadratic irrational. There exists a constant c > 0 such that for 
arbitrarily large values of N, the inequalities 


max{|x, — xo€|, |x2 — x0E7|}<cN 1”, |x0| < N, 


have no solution (Xo, X1,X2) € Z. 


678 M. Waldschmidt 


Before restricting ourselves to the small values of 7, we emphasise that there is 
a huge lack in our knowledge of the spectrum of the set 


(on(&), On(&), w (6), 01(6)) € RB’, 


where & ranges over the set of real numbers which are not algebraic of degree < n. 

Consider the special case n = 2 and the question of quadratic approximation. As 
pointed out by Y. Bugeaud, a formula due to V. Jarnik (1938) (Theorem 1 of [72]; 
see also Corollary A3 in [118] and [91]) relates @ and 0: 


rT a oe 
o(€) = 1 mn®) (29) 


Therefore the properties of ©) which we considered in §2.3 can be translated into 
properties of '. For instance, @5(€) = 1/2 if and only if @2(&) = 2, and this 
holds for almost all € € R (see Theorem 18) and for all algebraic real numbers & of 
degree > 3 (see Theorem 19). If € € R is neither rational nor a quadratic irrational, 
Davenport and Schmidt have proved 


O5(E) < 1/y = 0.618.... (30) 


The extremal numbers of D. Roy in Theorem 23 satisfy @5(€) = 1/y. More 
precisely, they are exactly the numbers € € R which are not rational or quadratic 
irrationals and satisfy the following property: there exists a constant c > 0 such that 
for any sufficiently large number N, the inequalities 


max{|x1 — xo|,|x2 — xo£?|}<cN7/”, 0 < max{|xol, [xi], |xol} < N, 


have a solution in rational integers Xo, X1, X2. (This was the original definition). 

In [118], using Jarnik’s formula (29), D. Roy shows that the set of 
(O5(E), O5(E)) € R*, where & ranges over the set of real numbers which are not 
algebraic of degree < 2, is dense in the piece of curve 


l= aetere yet: 


We conclude with the case n = 3 and the question of cubic approximation. When 
— € R is not algebraic of degree < 3, the estimate for @(€) by Davenport and 
Schmidt [49] is 


Te 1 
oe er, 
3 <3) < 5 


As we have seen, the lower bound is optimal (equality holds for almost all numbers 
and all algebraic numbers of degree > n). The upper bound has been improved by 
D. Roy in [119] 


= 1 
O5(6) < 5Qy +1— V4y? + 1) = 0.4245... 
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2.5 Algebraic approximation to a single number 


Let € be a real number and 7 a positive integer. 

Denote by w;*(&) the supremum of the real numbers w for which there exist 
infinitely many positive integers N with the following property: there exists an 
algebraic number a of degree < n and height < N satisfying” 


0<|f-a|\<N!. (31) 


An upper bound for w*(&) is a measure of algebraic approximation for &. These 
numbers arise in Koksma’s classification of complex numbers (Chap. III §3 of [128] 
and §3.3 of [36]). 

Next, denote by @*(&) the supremum of the real numbers w such that, for any 
sufficiently large integer N, there exists an algebraic number a of degree < n and 
height < N satisfying 

|§ —a| < H(a)'N™. 


An upper bound for @*(€) yields a uniform measure of algebraic approximation 
for &. 

From Schmidt’s subspace theorem one deduces, for a real algebraic number & of 
degree d and forn > 1, 


@; (&) = o, (€) = min{n, d — 1}. 


See [36] Th. 2.9 and 2.11. 

That there are relations between w, and w;* (and, for the same reason, between 
@» and or ) can be expected from Lemmas 9 and 10. Indeed, a lot of information on 
these numbers has been devised in order to compare the classifications of Mahler 
and Koksma. The estimate 


Qn (€) Z op (€), 


which follows from Lemma 9, was known by Koksma (see also Wirsing’s paper 
[139]). In the reverse direction, the inequalities 


of >on —nt1, af e> AF" (32) 
i (é) 
* Mn 
wy (E) 2 we) =n (33) 


were obtained by E. Wirsing in 1960 [139] (see §3.4 of [36]). 


The occurrence of —1 in the exponent of the right-hand side of (31) is already plain for degree 1 
polynomials, comparing ja — p/q| and |ga — p|. 
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A consequence is that for a real number & which is not algebraic of degree < n, 
if @,(€) = n then w*(&) =n. 
The inequality (33) of Wirsing has been refined in Theorem 2.1 of [41] as follows. 


Theorem 34 (Bugeaud and Laurent). Let n be a positive integer and & a real 
number which is not algebraic of degree <n. Then 


on (&) 


= On(é) 
Oe) =H el and 0, (§) > = 


B*(E) > i, 

n (§) (@—ntl 

A number of recent papers are devoted to this topic, including the survey given 

in the first part of [41] as well as Bugeaud’s papers [5, 31, 34, 37,39], where further 
references can be found. 

We quote Proposition 2.1 of [41], which gives connections between the six 


ia je AP co, ee 
exponents @y, On, 0), @,, 0, ,O,- 


Proposition 35. Let n be a positive integer and & a real number which is not 
algebraic of degree < n. Then 


“ < 0/(E) < min{1,o/(6)} 
and 


1 < @7(&) < min{w; (&), On(&)} < max{o; (&), On(E)} < On(E). 


A further relation connecting w;* and @/, has been discovered by H. Davenport 
and W.M. Schmidt in 1969 [49]. We discuss their contribution in §2.6. For our 
immediate concern here we only quote the following result: 


Theorem 36. Let n be a positive integer and — a real number which is not algebraic 
of degree < n, Then 


on ()@,, (€) = 1. 


The spectral question for w* is one of the main challenges in this domain. 
Wirsing’s conjecture states that for any integer n > 1 and any real number € which 
is not algebraic of degree < n, we have w* (&) = n. In other terms: 


Conjecture 37 (Wirsing). For any € > 0 there is a constant c(&,n,€) > 0 for 
which there are infinitely many algebraic numbers a of degree < n with 


lf —a| <c(,n,—)H(@y "IFS. 


In 1960, E. Wirsing [139] proved that for any real number which is not algebraic 
of degree < n, the lower bound w*(&) > (n + 1)/2 holds: it suffices to combine 
(32) with the lower bound w, (§) = n from (17) (see [124] Chap. VHI Th. 3B). More 
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precisely, he proved that for such a € € R there is a constant c(E,n) > 0 for which 
there exist infinitely many algebraic numbers a of degree < n with 


|€ —a| < c(E,n)H(a) 9)”, 


The special case n = 2 of this estimate was improved in 1967 when H. Davenport 
and W.M. Schmidt [48] replaced (n + 3)/2 = 5/2 by 3. This is optimal for 
the approximation to a real number by quadratic algebraic numbers. This is the 
only case where Wirsing’s conjecture is solved. More recent estimates are due to 
V.I. Bernik and K. Tishchenko [28, 132-136]. This question is studied by Y. Bugeaud 
in his book [36] (§3.4) where he proposes the following main problem: 


Conjecture 38 (Bugeaud). Let (wy, )n>1 and (w*)n>1 be two non—decreasing se- 
quences in [1, +00] for which 


* 


n<we<w,<we+n—1 foranyn=1. 


Then there exists a transcendental real number & for which 
On(E) =Wn and w7(€)=w, foranyn > 1. 


A summary of known results on this problem is given in §7.8 of [36]. 
The spectrum 


{on(&) — oF (E); & € R not algebraic of degree < n} C [0,n — 1] 


of @, —@,; forn > 2 was studied by R.C. Baker in 1976 who showed that it contains 
[0, 1—(1/n)]. This has been improved by Y. Bugeaud in [34]: it contains the interval 
[0,7 /4]. 

Most results concerning w* (&) and @* (€) for € € R have extensions to complex 
numbers, only the numerical estimates are slightly different. However, see [40]. 


2.6 Approximation by algebraic integers 


An innovative and powerful approach was initiated in the seminal paper [49] by 
H. Davenport and W.M. Schmidt (1969). It rests on the transference principle arising 
from the geometry of numbers and Mahler’s theory of polar convex bodies and 
allows one to deal with approximation by algebraic integers of bounded degree. The 
next statement includes a refinement by Y. Bugeaud and O. Teulié (2000) [45] who 
observed that one may treat approximations by algebraic integers of given degree; 
the sharpest results in this direction are due to M. Laurent [88]. 

From the estimate w*(&)@/,(€) > 1 in Theorem 36 one deduces the following 
statement. Let 1 be a positive integer and let € be a real number which is not 
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algebraic of degree < n. Let A satisfy @/,(€) < A. Then for « = (1/A) + 1, there is 
a constant c(n, &,«) > 0 such that the equation 


|§ —a| <c(n, &,«)H(a) * (39) 


has infinitely many solutions in algebraic numbers @ of degree n. In this statement 
one may replace “algebraic numbers a of degree n” by “algebraic integers a of 
degree n + 1” and also by “algebraic units a of degree n + 2.” 


Proposition 40. Let « > 1 be a real number, n a positive integer and & be a real 
number which is not algebraic of degree <n. Assume 01, (&) < 1/(k — 1). Then 
there exists a constant c(n,&,«) > 0 such that there are infinitely many algebraic 
integers a of degree n + 1 satisfying (39) and there are infinitely many algebraic 
units a of degree n + 2 satisfying (39). 


Suitable values for « are deduced from Theorem 27 and estimate (21). For 
instance, from Theorem 36 and the estimate 04 (&) < 1/y of Davenport and Schmidt 
in (30) one deduces w}(&) > y. Hence for any x < 1 + y the assumptions of 
Proposition 40 are satisfied. More precisely, the duality (or transference) arguments 
used by Davenport and Schmidt to prove Theorem 36 together with their Theorem 
28 enabled them to deduce the next statement ([49], Th. 1). 


Theorem 41 (Davenport and Schmidt). Let & € R be a real number which is 
neither rational nor a quadratic irrational. Then there is a constant c > 0 with the 
following property: there are infinitely many algebraic integers a of degree at most 
3 which satisfy 

0 <|&—a| < cH(a)”™. (42) 


Lemma 9 shows that under the same assumptions, for another constant c > 0 
there are infinitely many monic polynomials P € Z[X] of degree at most 3 satisfying 


|P(é)| < cH(P)”. (43) 


Estimates (42) and (43) are optimal for certain classes of extremal numbers [114]. 
Approximation of extremal numbers by cubic integers is studied by D. Roy in [114, 
115]. Further papers dealing with approximation by algebraic integers include [4, 
45, 122, 135]. 

Another development of the general and powerful method of Davenport and 
Schmidt deals with the question of approximating simultaneously several numbers 
by conjugate algebraic numbers: this is done in [122] and refined in [117] by D. Roy. 
Also in [117] D. Roy gives variants of Gel’ fond’s transcendence criterion involving 
not only a single number & but sets {y + &1,...,v + &n} or {yé&,..., v&n}. In two 
recent manuscripts [120, 121], D. Roy produces new criteria for the additive and for 
the multiplicative groups. 

A different application of transference theorems is to link inhomogeneous 
Diophantine approximation problems with homogeneous ones [42]. 
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2.7 Overview of metrical results for polynomials 


Here we give a brief account of some significant results that have produced new 
ideas and generalisations, as well as some interesting problems and conjectures. 
We begin with the probabilistic theory (that is, Lebesgue measure statements) 
and continue with the more delicate Hausdorff measure/dimension results. Results 
for multivariable polynomials, in particular, the recent proof of a conjecture of 
Nesterenko on the measure of algebraic independence of almost all real m-tuples, 
will be sketched in §3.6, as will metrical results on simultaneous approximation. 
Note that many of the results suggested here have been established in the far 
more general situation of Diophantine approximation on manifolds. However, for 
simplicity, we will only explain this Diophantine approximation for the case of 
integral polynomials. 

Mahler’s problem [96], which arose from his classification of real (and complex) 
numbers, remains a major influence over the metrical theory of Diophantine 
approximation. As mentioned in §2.3, the problem was settled by Sprindzuk in 
1965. Answering a question posed by A. Baker in [9], Bernik [25] established a 
generalisation of Mahler’s problem akin to Khintchine’s one-dimensional conver- 
gence result in Theorem 7, involving the critical sum 


> wh) (44) 


h=1 


of values of the function Y : N — R* that defines the error of approximation. 


Theorem 45 (Bernik, 1989). Given a monotonic W such that the critical sum (44) 
converges, for almost all — € R the inequality 


|P(&)| < H(P)"*'W(A(P)) (46) 


has only finitely many solutions in P € Z|x] with deg P <n. 


In the case n = 1, inequality (46) reduces to rational approximations of real 
numbers and is covered by Khintchine’s Theorem 7 [73]. Khintchine’s Theorem 
7 also covers the solubility of (46) when n = 1 and (44) diverges. For arbitrary 
n the complementary divergence case of Theorem 45 has been established by 
Beresnevich, who has shown in [13] that if (44) diverges then for almost all real 
& inequality (46) has infinitely many solutions P € Z[x] with deg P = n. In fact 
the latter statement follows from the following analogue of Khintchine’s Theorem 
7 for approximation by algebraic numbers, also established in [13]. 


Theorem 47 (Beresnevich, 1999). Letn € N, YW: N > Rt be a monotonic error 
function and A,,(W) the set of real & such that 


|§ —a| < H(a)"Y(A(@)) (48) 
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has infinitely many solutions in real algebraic numbers of degree dega = n. Then 
A,(W) has full Lebesgue measure if the sum (44) diverges and zero Lebesgue 
measure otherwise. 


Bugeaud [32] has proved an analogue of Theorem 47 for approximation by algebraic 
integers: 


Theorem 49 (Bugeaud, 2002). Letn € N,n > 2, ¥: N > R® be a monotonic 
error function and T,(W) be the set of real & such that 


|E —a| < H(a)"*'W(A(a)) (50) 


has infinitely many solutions in real algebraic integers of degree dega = n. Then 
T,(W) has full Lebesgue measure if the sum (44) diverges and zero Lebesgue 
measure otherwise. 


No analogue of Theorem 45 is known for the monic polynomial case; however, 
see the final section of [23]. 

Unlike Theorem 45, the convergence parts of Theorems 47 and 49 are rather 
trivial consequences of the Borel—Cantelli lemma, which also implies that the 
monotonicity condition is unnecessary in the case of convergence. The intriguing 
question now arises whether the monotonicity condition in Theorem 45 and in 
the divergence part of Theorems 47 and 49 can be dropped. Beresnevich [16] has 
recently shown that the monotonicity condition on W can indeed be safely removed 
from Theorem 45. Regarding Theorems 47 and 49, removing the monotonicity 
condition is a fully open problem - [16]. In fact, in dimension n = 1, removing 
the monotonicity from Theorem 47 falls within the Duffin and Schaeffer problem 
[56]. Note, however, that the higher -dimensional Duffin—Schaeffer problem has 
been settled in the affirmative [108]. 

It is interesting to compare Theorems 47 and 49 with their global counterparts. 
In the case of approximation by algebraic numbers of degree < n, the appropriate 
statement is known as the Wirsing conjecture 37 (see §2.5). The latter has been 
verified for n = 2 by Davenport and Schmidt but is open in higher dimensions. 
Theorem 47 implies that the statement of the Wirsing conjecture 37 holds for almost 
all real € — the actual conjecture states that it is true at least for all transcendental 
&. In the case of approximation by algebraic integers of degree < n, Roy has 
shown that the statement analogous to Wirsing’s conjecture is false [114]. However, 
Theorem 49 implies that the statement holds for almost all real &, In line with the 
recent “metrical” progress on Littlewood’s conjecture by Einsiedler, Katok, and 
Lindenstrauss [57], it would be interesting to find out whether the set of possible 
exceptions to the Wirsing—Schmidt conjecture is of Hausdorff dimension zero. A 
similar question can also be asked about approximation by algebraic integers; this 
would shed light on the size of the set of exceptions, shown to be non-empty by Roy. 
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A. Baker [10] suggested a strengthening of Mahler’s problem in which the height 
A(P) = max{|a,|,..., |ao|} of the polynomial P(x) = a,x" +--++ a,x + do is 
replaced by 


H*(P) =| | max{1, |a;|}'/”. 


i=1 


The corresponding statement has been established by Kleinbock and Margulis [76] 
in a more general context of Diophantine approximation on manifolds. Specialising 
their result to polynomials gives the following. 


Theorem 51 (Kleinbock & Margulis, 1998). Let ¢ > 0. Then for almost all & € R 
the inequality 
|P()| < H*(Py** (52) 


has only finitely many solutions in P € Z|x] with deg P <n. 


A multiplicative analogue of Theorem 45 with H(P) replaced by H*(P) has 
been obtained by Bernik, Kleinbock and Margulis [27] (also within the framework 
of manifolds). Note that in their theorem the convergence of (44) must be replaced 
by the stronger condition that )°72, W(h)(logh)""! < oo. This condition is 
believed to be optimal but it is not known if the multiplicative analogue of 
Theorem 47, when H(P) is replaced by H*(P), holds. In [23], Beresnevich and 
Velani have proved an inhomogeneous version of the theorem of Kleinbock and 
Margulis and, in particular, an inhomogeneous version of Theorem 51. 

With [11], A. Baker and W.M. Schmidt pioneered the use of Hausdorff dimension 
in the context of approximation of real numbers by algebraic numbers with a natural 
generalisation of the Jarnik—Besicovitch theorem: 


Theorem 53 (Baker & Schmidt, 1970). Let w = n. Then the set of — € R for 
which 
|§ —a| < H(a)™ (54) 


holds for infinitely many algebraic numbers a with dega < n has Hausdorff 
dimension (n + 1)/(w + 1). 


In particular, Theorem 53 implies that the set 


A(w) = {é eR: |P(§)| < H(P)™ for infinitely many P € Z[x], deg P < n\ 
(55) 


has Hausdorff dimension at least (1 + 1)/(w + 1). Baker and Schmidt conjectured 
that this lower bound is sharp, and this was established by Bernik in [24]: 

n+1 

w+ 


This theorem has an important consequence for the spectrum of Diophantine 
exponents already discussed (see Chap. 5 of [36]). Bugeaud [32] has obtained an 


Theorem 56 (Bernik, 1983). Let w > n. Then dim A(w) = 
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analogue of Theorem 53 in the case of algebraic integers. However, obtaining an 
analogue of Theorem 56 for the case of algebraic integers is as yet an open problem. 

Recently, Beresnevich, Dickinson, and Velani have established a sharp Hausdorff 
measure version of Theorem 53, akin to a classical result of Jarnik. In order to avoid 
introducing various related technicalities, we refer the reader to [21, §12.2]. Their 
result implies the corresponding divergent statement for the Hausdorff measure of 
the set of € € R such that (46) holds infinitely often. Obtaining the corresponding 
convergent statement represents yet another open problem. 

There are various generalisations of the above results to the case of complex and 
p-adic numbers and more generally to the case of S-arithmetic (for instance by 
D. Kleinbock and G. Tomanov in [77]). 


3 Simultaneous Diophantine approximation in higher 
dimensions 


In §2.3, we considered polynomial approximation to a complex number &, which 
is the study of |P(&)| for P € Z[X]. As we have seen, negative results on the 
existence of polynomial approximations lead to transcendence measures. A more 
general situation is to fix several complex numbers xj,...,X and to study the 
smallness of polynomials in these number; negative results provide measures of 
algebraic independence to X1,...,Xm. 

This is again a special case, where & = x/'---x4, of the study of linear 
combinations in &, ..., &), where &, ..., &, are given complex numbers. Now 
negative results are measures of linear independence to &, ..., &n. 

There are still more general situations which we are not going to consider 
thoroughly but which are worth mentioning, namely the study of simultaneous 
approximation of dependent quantities and approximation on a manifold (see for 
instance [26]). 

We start with the question of algebraic independence (§3.1) in connection with 
extensions to higher dimensions of Gel’ fond’s Criterion 11. Next (§§3.2 and 3.3) we 
discuss a recent work by M. Laurent [91], who introduces further coefficients for the 
study of simultaneous approximation. The special case of two numbers (§3.4) is best 
understood so far. 

There is a very recent common generalisation of the question of Diophantine 
approximation to a point in R” which is considered in §3.3 on the one hand, and of 
the question of approximation to a real number by algebraic numbers of bounded 
degree considered in §2.5 on the other hand. It consists in the investigation of the 
approximation to a point in R” by algebraic hypersurfaces, or more generally alge- 
braic varieties defined over the rationals. This topic has been recently investigated 
by W. M. Schmidt in [127] and [126]. 
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3.1 Criteria for algebraic independence 


In §2.2 we quoted Gel’fond’s algebraic independence results of two numbers of 
the form a’ (1 < i < d —1). His method has been extended in the work of 
several mathematicians including A.O. Gel’fond, A.A. Smelev, W.D. Brownawell, 
G.V. Chudnovsky, P. Philippon, Yu.V. Nesterenko, G. Diaz (see [68], [79], [131], 
[101, 102], [104, 107], [59] Chap. 6 and [103]). So far the best known result, due to 
G. Diaz [50], proves “half” of what is expected. 


Theorem 57. Let B be an algebraic number of degree d > 2 and a a non-zero 
algebraic number. Moreover, let loga be any non-zero logarithm of a. Write a in 
place of exp(zlog a). Then among the numbers 


at least [(d + 1)/2] are algebraically independent. 


In order to prove such a result, as pointed out by S. Lang in [78], it would have 
been sufficient to replace the transcendence criterion theorem 11 by a criterion 
for algebraic independence. However, an example, going back to A. Ya. Khintchine 
in 1926 [74] and quoted in J.W.S. Cassels’s book ([46] Chap. V, Th. 14; see also 
the appendix of [104] and Appendix A of [121]), shows that in higher dimensions, 
some extra hypothesis cannot be avoided (and this is a source of difficulty in the 
proof of Theorem 57). After the work of W.D. Brownawell and G.V. Chudnovsky, 
such criteria were proved by P. Philippon [104, 107], Yu. V. Nesterenko [101, 102], 
M. Ably, C. Jadot (further references are given in [59] and [138] §15.5). Reference 
[103] is an introduction to algebraic independence theory which includes a chapter 
on multihomogeneous elimination by G. Rémond [109] and a discussion of criteria 
for algebraic independence by P. Philippon [107]. 

Further progress was made by M. Laurent and D. Roy in 1999, who produced 
criteria with multiplicities [92,93] (see also [94]) and considered questions of ap- 
proximation by algebraic sets. Moreover, in [93] they investigate the approximation 
properties, by algebraic numbers of bounded degree and height, of a m-tuple which 
generates a field of transcendence degree 1. This means that the corresponding point 
in C” belongs to an affine curve defined over Q. For m = | they proved in [92] the 
existence of approximation; this has been improved by G. Diaz in [51]. Further 
contributions are due to P. Philippon (see for instance [106]). 

A very special case of the investigation of Laurent and Roy is a result related to 
Wirsing’s lower bound for w;* (see §2.5), with a weak numerical constant, but with 
a lower bound for the degree of the approximation. Their result (Corollary 1 of §2 
of [93]) has been improved by Y. Bugeaud and O. Teulié (Corollary 5 of [45]) who 
prove that the approximations @ can be required to be algebraic numbers of exact 
degree n or algebraic integers of exact degree n + 1. 
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Theorem 58 (Bugeaud and Teulié). Let ¢ > 0 be a real number, n = 2 an integer 
and & a real number which is not algebraic of degree n. Then the inequality 


|é _ a| < Ha)" t9)/2) + 


has infinitely many solutions in algebraic integers a of degree n. 


The ¢€ in the exponent can be removed by introducing a constant factor. Further, 
for almost all aw € R, the result holds with the exponent replaced by n, as shown by 
Y. Bugeaud in [33]. 

There are close connections between questions of algebraic independence and si- 
multaneous approximation of numbers. We shall not discuss this subject thoroughly 
here; it would deserve another survey. We just quote a few recent papers. 

Applications of Diophantine approximation questions to transcendental number 
theory are considered by P. Philippon in [105]. M. Laurent [87] gives heuristic 
motivations in any transcendence degree. Conjecture 15.31 of [138] on simultaneous 
approximation of complex numbers suggests a path towards results on large 
transcendence degree. 

In [110] D. Roy shows some limitations of the conjectures of algebraic ap- 
proximation by constructing points in C” which do not have good algebraic 
approximations of bounded degree and height, when the bounds on the degree 
and height are taken from specific sequences. The coordinates of these points are 
Liouville numbers. 


3.2. Four exponents: asymptotic or uniform simultaneous 
approximation by linear forms or by rational numbers 


Let &, ..., &, be real numbers. Assume that the numbers 1, &), ..., &, are linearly 
independent over Q. There are (at least) two points of view for studying approxima- 
tion to €;, ..., &,. On the one hand, one may consider linear forms (see for instance 
[78]) 


|Xo + x1&; free Ht Xn€n|- 


On the other hand, one may investigate the existence of simultaneous approximation 
by rational numbers 


Each of these two points of view has two versions, an asymptotic one (with exponent 
denoted by w) and a uniform one (with exponent denoted by @). This gives rise to 
four exponents introduced in [42] (see also [91]), 


o(9), @(6), o(f8), (6), 
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where 

&1 
6=(&,...,&) and ‘6=]: 
En 

We shall recover the situation of §§2.1 and 2.4 in the special case where &; = &', 
1<=7 < ator? = 6.8), 


(0) = an(€), O00) =On(§), o( 0) =a,(€), OF) = 0, (€). 


Notice that the index n is implicit in the notation involving a, since it is the number 
of components of 0. 

We start with the question of asymptotic approximation by linear forms. We 
denote by w(@) the supremum of the real numbers w for which there exist infinitely 
many positive integers N for which the system 


[xo + x1) +--+ + 4n&,| < NY, O< max |x;| < N, (59) 
O<i<n 
has a solution in rational integers xo, X1,..., X,. An upper bound for w(@) is a linear 
independence measure for 1, &1,..., &n. 


The hat version of w(@) is, as expected, related to the study of uniform 
approximation by linear forms: we denote by @(@) the supremum of the real 
numbers w such that for any sufficiently large integer N, the same system (59) has 
a solution. 

Obviously @(@) < w(6). 

The second question is that of asymptotic simultaneous approximation by 
rational numbers. Following again [91], we denote by w(‘@) the supremum of the 
real numbers w for which there exist infinitely many positive integers N for which 
the system 


max |x; — x9& |< N~”, with O< max |x;|<N (60) 
1<i<n O0<i<n 
has a solution in rational integers xo, Xx,,...,X,. An upper bound for w(‘@) is a 
simultaneous approximation measure for 1, £,,..., &n. 


The uniform simultaneous approximation by rational numbers is measured by 
the hat version of w: we denote by @(‘@) the supremum of the real numbers w such 
that for any sufficiently large integer NV, the same system (60) has a solution. 

Again @(0) < w('0). 

Transference principles provide relations between w(@) and w(‘@). The next 
result (Khintchine, 1929 [74]) shows that w(@) = n if and only if w('0) = 1/n. 
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Theorem 61 (Khintchine transference principle). [f we set @ = w(6) and'w = 
w('9), then we have 


w 


wo>n'otn—-1 and ‘wo >———. 
(n-—l)w+n 


In order to study these numbers, M. Laurent introduces in [91] further exponents 
as follows. 


3.3. Further exponents, following M. Laurent 


For each d in the range 0 < d < n — 1, M. Laurent [91] introduces two exponents, 
one for asymptotic approximation wy (8) and one for uniform approximation @q (9), 
which measure the quality of simultaneous approximation to the given tuple 06 = 
(&1,...,&,) from various points of view. First embed R” into P”(R) by mapping 


6 = (&,...,&) to (& 2 +++: & 2 1). 


Now for 0 < d <n — 1, define 
wa(0) = sup \w: there exist infinitely many vectors 
X =x Av Axg € AMH (Zt) for which |X A 6] < xiv} 
and 
@a(@) = sup {w: for any sufficiently large NV, there exists 
X =x A---Axg € AMET) 


such that0 <|X|<N and |XA0|< nol 


Hence wy (0) > @a(O). 
The multivector ¥ = x9 A-::A Xq is a system of Pliicker coordinates of the 
linear projective subvariety L = (xo,...,Xa) C P”(R). Then 


|X A 6| 
|X||6| 
is essentially the distance d(@,L) = min,ez, d(0,x) between the image of @ in 


P”(R) to L. As a consequence, equivalent definitions are as follows, where H(L) 
denotes the Weil height of any system of Plticker coordinates of L. 


wa(0) = sup {w: there exist infinitely many L, rational over Q, 


dimL =d and d(0,L) < HL) 
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and 
@a(@) = sup {w: for any sufficiently large N, there exists L, rational over Q, 
dim L = d, H(L) < N and d(6,L) < H(L) IN} 
In the extremal cases d = 0 and d = n — 1, one recovers the exponents of § 3.2: 
(8) = (8), @o(P) = OA), @n-1(8) = (9), @n-1(8) = O(8). 


The lower bound 
= d+1 
nC eee forall d=0,..., n—-1 
n—d 
valid for all 6 (with 1,&, ..., &, linearly independent over Q) follows from the 
results of W.M. Schmidt in his foundational paper [123] (see [44]). In particular for 
d =n-—1andd = Orespectively, this lower bound yields 


@(0)=n and @('6)>1/n 
and in the special case &; = &' (1 <i <n) one recovers the lower bounds 
OnE) >n and 1 (€) > 1/n, 


which we deduced in § 2.3 and § 2.4 respectively from Dirichlet’s box principle. 

It was proved by Khintchine in 1926 [74] that w(@) =n if and only if 
w('6) = 1/n. In [91], M. Laurent slightly improves on earlier inequalities due 
to W.M. Schmidt [123], splitting the classical Khintchine’s transference principle 
(Theorem 61) into intermediate steps. 


Theorem 62 (Schmidt, Laurent). Fix n > 1 and 0 = (&,...,&) € R". Set 
@a = @a(8), 0 < d <n-—1. The “going up transference principle” is 


i O0<d<n-2, 
n—-d-—1 


while the “going down transference principle” is 


dod 


——, l<d<n-l. 
@gt+d+1 


@d-1 2 


Moreover, these estimates are optimal. 


692 M. Waldschmidt 


As a consequence of Theorem 62, one deduces that if a = (d + 1)/(n — d) 
for one value of d in the range 0 < d < n — 1, then the same equality holds for all 
d =0,1,...,n — 1. Hence, for almost all 0 € R”, 


x d+1 
wa(9) = @a(9) = = for O<d<n-1l. 
n—d 
A complement to Theorem 62, involving the hat coefficients, is given in [91] Th. 3. 
A problem raised in [91] is to find the spectrum in (R U {+00})" of the n-tuples 


(wo(4), .-+,@n—1(8)), 


where @ ranges over the elements (&,...,§)) in R" with 1,&,...,&, linearly 
independent over Q. Partial results are given in [91]. 

In [42] Y. Bugeaud and M. Laurent define and study exponents of inhomogeneous 
Diophantine approximation. Further progress on this topic has been achieved by 
M. Laurent in [89]. 


3.4 Dimension 2 


We consider the special case n = 2 of §3.3: we replace (&1, &) by (&, 7). So let & 
and 7 be two real numbers with 1, &, 7 linearly independent over Q. 
Khintchine’s transference Theorem 61 reads in this special case 


o(§, 1) <o(F) <p 
o§n)+2 ~~ \nd 2 


V. Jarntk studied these numbers in a series of papers from 1938 to 1959 (see [36, 42, 
90]). He proved that both sides are optimal. Also Jarnik’s formula (of which (29) is 


a special case) reads 
- ‘) 1 
o =l-= : (63) 
é o(§,n) 


The spectrum of each of our four exponents is as follows: 


w(&, n) takes any value in the range [2, +00], 

a) (*) takes any value in the range [1/2, 1], 
n 

@(€, 7) takes any value in the range [2, +oo], 


iO) ) takes any value in the range [1/2, 1]. 
ui] 
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Moreover, for almost all (€, 7) € R?, 


olE.n) = OE, n) = 2, o(*) =a(°) = 


A more precise description of the spectrum of the quadruple is due to M. Laurent 
[90]: 


Theorem 64 (Laurent). Assume 1, &, 17 are linearly independent over Q. The four 
exponents 


we wen: vo =o(F). Net a’ -a(*) 


are related by 


Ba ~, @-1 ow(@-1) , o-@+1 
2<@<+0, 0 = — <4 = —_— 
(a) o+ao (a) 


with the obvious interpretation if w = +00. Conversely, for any (w, w',®,@’) in 


(Rs U {+00})* satisfying the previous inequalities, there exists (,n) € R’, with 
1, €, n linearly independent over Q, for which 


@ = w(€,n), vo =o('). ® = of€,n), a'=a(*). 


As a consequence: 
Corollary 65. The exponents w = w(&,n), © = O(E,n) are related by 
w@>@(@-1) and O22. 
Conversely, for any (@,@) satisfying these conditions, there exists (&, 1) for which 


o(§.n)=@ and O(§.n) =o. 


Corollary 66. The exponents w' = w (*). o' =O (*) are related by 
1 1 


oO” 


o’ 2. —— and 
1—o’ 


Conversely, for any (', @') satisfying these conditions, there exists (E,) with 
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The next open problem has been raised by M. Laurent: 


Open Problem 67 (Laurent). /s there an extension of Jarnik’s equality (63) in 
higher dimensions relating (8) and @('6) for 8 € R"? 


3.5 Approximation by hypersurfaces 


In dimension 1 an irreducible hypersurface is nothing else than a point. The 
exponents w,(&) and their hat companions in § 2.3 measure | P(&)| for P € Z[X], 
while w*(&) of §2.5 measure the distance between a point € € C and algebraic 
numbers q@. 

A generalisation of these questions in higher dimensions, where & € C", is the 
study of | P(€)| for P € Z[X,..., X,] and of ming |E—a|, where a runs over the set 
of zeros of such P. As already mentioned in the introduction of §3, a lower bound 
for | P(&)| when the degree of P is fixed is nothing else than a linear independence 
measure for (€,...,&”). To consider such quantities also when the degree of P 
varies yields a generalisation of Mahler’s classification to several variables, which 
has been considered by Yu Kunrui [140]. A generalisation to higher dimensions 
of both Mahler and Koksma classifications has been achieved by W.M. Schmidt in 
[126], who raises a number of open problems suggesting that the close connection 
between the two classifications in dimension | does not extend to the classification 
of tuples. 

In [127] W.M. Schmidt deals with approximation to points € in R” or in C” by 
algebraic hypersurfaces, and more generally by algebraic varieties, defined over the 
rationals. 

Let M be a nonempty finite set of monomials in x),...,X, with |M| elements. 
Denote by P(M) the set of polynomials in Z[x),...,X,] which are linear com- 
binations of monomials in M. Using Dirichlet’s box principle or Minkowski’s 
theorem on linear forms, one shows the existence of nonzero elements in P(M) 
for which | P(&)| is small. It is a much more difficult task to get the existence of 
nonzero elements in P(M) for which the distance 5(E : A(P)) between & and the 
hypersurface A(P) defined by P = 0 is small. 

W.M. Schmidt asks whether given € and M, there exists c = c(&, M) > 0 such 
that there are infinitely many P € P(M) with 5(€, A(P)) < cH(P)~”, where 
m = |M| in the real case € € R" and m = |M|/2 in the complex case € € C”. 
He proves such an estimate when |M| = n + 1, and also in the real case when 
|M| = n + 2. In the case |M| = n + 1 he proves a uniform result, in the sense 
of Y. Bugeaud and M. Laurent [41]: given N = 1, there isa P € P(M) with 
height H(P) < N for which 5(&, A(P)) < cN'"" H(P)~!. A number of further 
results are proved in which the exponent is not the conjectured one. The author also 
investigates the approximation by algebraic hypersurfaces (another reference on this 
topic is [94]). 
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Special cases of the very general and deep results of this paper were due to 
F. Amoroso, W.D. Brownawell, M. Laurent and D. Roy, and P. Philippon. Further 
previous results related with Wirsing’s conjecture were also achieved by V.I. Bernik 
and K.I. Tishchenko. 

An upper bound for the distance 6(€, A) means that there is a point on the 
hypersurface A (or more generally the variety A) close to &. The author also 
investigates the “size” of the set of such elements. The auxiliary results proved in 
[127] on this question have independent interest. 


3.6 Further metrical results 


The answer to the question of Schmidt on approximation of points & € R” by 
algebraic hypersurfaces A(P) is almost surely affirmative. This follows from a 
general theorem established by Beresnevich, Bernik, Kleinbock and Margulis. With 
reference to Section 3.5, let MM be a set of monomials of cardinality m = |M| in 
variables x;,...,X;, where we naturally assume that m > 2. Further, let P(M) be 
the set of polynomials in Z[x,,..., x;] which are linear combinations of monomials 
in M. Given a function W : N — (0, +00), let 


Ac(%M) = {(Eis.-+5 80) € 10, IF : |P EL... 6) < HP PwH(P)) 
for infinitely many P € Pim} ‘ 


We are interested in |A;,(W%M)|, the k-dimensional Lebesgue measure of 


Ac(Y M). 


Theorem 68 (Beresnevich, Bernik, Kleinbock and Margulis). For any decreas- 
ing W, 


0 if po WA) < co, 


Lif DR Wh) = 00. 


The convergence case of this theorem has been independently established by Beres- 
nevich [15] and Bernik, Kleinbock and Margulis [27] using different techniques. 
The multiplicative analogue of the convergence part of Theorem 68, where H(P) is 
replaced with H *(P), has also been obtained in [27]. In addition, Theorem 68 holds 
when M is a set of m analytic functions defined on (0, 1) and linearly independent 
over R. The analyticity assumption can also be relaxed towards a non-degeneracy 
condition. 

The divergence case is established in [19] in the following stronger form 
connected with §3.5, where the notation 5 and A(P) are explained. 


|Ak(Y% M)| = 
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Theorem 69 (Beresnevich, Bernik, Kleinbock and Margulis). Let W be decreas- 
ing and such that )\;-., (h) diverges. Then for almost all § = (&,...,&&) € 
(0, iF, 

5(§, A(P)) < H(P)-""'W(H(P)) 


has infinitely many solutions P € P(M). 


Taking W(h) = h7!log 'h, we get the following corollary which answers 
Schmidt’s question in §3.5 in the affirmative for almost all points: 


Corollary 1. For almost all & = (&,...,&;) € R*, the inequality 
6(E, A(P)) < H(P)~" log! H(P) (70) 


has infinitely many solutions P € P(M). 


Another interesting corollary corresponds to the special case of M being the 
set of all monomials of degree at most d. In this case we simply have the case of 
approximation by multivariable polynomials of degree at most d, where now 


m= (55) 


In the case of convergence in Theorem 68, a lower bound for the Hausdorff 
dimension of A; (YW, M) is implied by a general theorem for manifolds of Dickinson 
and Dodson [53]. Obtaining the corresponding upper bound in general remains an 
open problem but see Theorem 56 and [12, 17,54]. The Hausdorff measure version 
of Theorem 69 has been established in [21]. 

Yet another class of interesting problems concerns the measure of transcendence 
and algebraic independence of numbers. Recall that complex numbers Z),... , Z are 
called algebraically independent if for any non-zero polynomial P € Z[x1,...,Xml, 
the value P(z,,...,Zn) is not 0. Actually, P(z,...,2m) can still get very small 
when z),...,Zm are algebraically independent. Indeed, using Dirichlet’s pigeonhole 
principle, one can readily show that there is a constant c; > 0 such that for any real 
numbers x;,..., Xm, there are infinitely many polynomials P € Z[x,,...,X] such 
that 


Jt m+1 
loeee5 Am : , 
|P(x mie (71) 


where ¢(P) = deg P + log H(P) is called the type of P. A conjecture of Mahler 
[97] proved by Nesterenko [100] says that in the case m = 1 for almost all real 
numbers x, there is a constant co > 0 such that |P(x,)| > eo(P)” for all non- 
zero P € Z[x]. Nesterenko has also shown that for t = m + 2, for almost all 
(X1,...,Xm) € R”™ there is a constant cp > 0 such that 


—cot(P)* 


|P(x1,---,Xm)| >e for all non-zero P € Z[x,,...,Xml (72) 
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and conjectured that the latter is indeed true with the exponent t = m + 1. This has 
been verified by Amoroso [6] over C, but the “real” conjecture has been recently 
established by Nesterenko’s student Mikhailov [99]: 


Theorem 73 (Mikhailov, 2007). Lett = m+1. Then for almost all (x,,...,Xm) € 
R” there is a constant cy > 0 such that (72) holds. 


We conclude by discussing the interaction of metrical, analytic, and other 
techniques in the question of counting and distribution of rational points near a 
given smooth planar curve J". In what follows we will assume that the curvature 
of I” is bounded between two positive constants. Let Nr(Q, 6) denote the number 
of rational points (p1/q, p2/q), where pi, p2,g € ZwithO < q < OQ, withina 
distance at most 6 from I. 

Huxley [71] has proved that for any ¢ > 0, Nr(O,5) « QO7*°S + Q. Until 
recently, this bound has remained the only non-trivial result. Furthermore, very little 
has been known about the existence of rational points near planar curves for 6 < 
Q~?/*, that is whether Nr(Q,6) > 0 when 6 < Q~*/?. An explicit question of 
this type motivated by Elkies [58] has been recently raised by Barry Mazur who 
asks: “given a smooth curve in the plane, how near to it can a point with rational 
coordinates get and still miss?” (Question (3) in [98, § 11]). When = 0(Q~”), the 
rational points in question cannot miss I” if I” is a rational quadratic curve in the 
plane (see [26]). This leads to Nr(Q, 5) vanishing for some choices of ” when 6 = 
o(Q~*). For example, the curve I” given by x? + y* = 3 has no rational points [22]. 
When 5 >> Q~?, a lower bound on Nr-(Q, 8) can be obtained using Khintchine’s 
transfer principle. However, such a bound would be far from being close to the 
heuristic count of 075 (see [26]). The first sharp lower bound on N7(Q,64) has 
been given by Beresnevich [14], who has shown that for the parabola = (x, x’), 
Nr(Q,6) > O75 when 5 > Q~. 

Recently, Beresnevich, Dickinson, and Velani [22] have shown that for an 
arbitrary smooth planar curve I” with non-zero curvature Nr(Q,5) >> 035 when 
5 >> Q~. Moreover, they show that the rational points in question are uniformly 
distributed in the sense that they form a ubiquitous system (see [18] for a discussion 
on ubiquity and related notions). They further apply this to get various metric results 
about simultaneous approximation to points on J”. These include a Khintchine-type 
theorem and its Hausdorff measure analogue. In particular, for any w € (1/2, 1) 
they explicitly obtain the Hausdorff dimension of the set of w-approximable points 
on I": 


” 


dim| (x,y) € I : max{llqxll lay <q 


2~ 
for infinitely many g € N \ a (74) 
l+w 


Here I” is a smooth planar curve with non-vanishing curvature. Using analytic 
methods, Vaughan and Velani [137] have shown that e > 0 can be removed from 
Huxley’s estimate for Nr(Q, €). Combining the results of [22] and [137] gives the 
following natural generalisation of Khintchine’s theorem. 


698 M. Waldschmidt 


Theorem 75 (Beresnevich, Dickinson, Vaughan, Velani). Let ~ : N — (0, +00) 
be monotonic. Let be a C®) planar curve of finite length € with non-vanishing 
curvature and let 


Ax(y. 0) = {(x,y) € Ps max{ligxll lay} < ¥@ 
holds for infinitely many q € nt : 
Then the arclength? |A2(w, ’)| of Ar(W, I’) satisfies 


0 if DLA Wh) < oo, 
£ if Dh Yh) = 00. 


Ary, I) = 


Furthermore, let s € (0,1) and let H* denote the s-dimensional Hausdorff measure. 
Then 


0. yr Vay <a, 
H'(Ar(y, P)) = (76) 
+00 if Py Wh)’ = oo. 


Note that (74) is a consequence of (76). 

In higher dimensions, Drutu [55] has studied the distribution of rational points 
on non-degenerate rational quadrics in R” and obtained a result similar to (76) in 
the case w(q) = o(q~*). However, simultaneous Diophantine approximation on 
manifolds as well as the distribution of rational points near manifolds (in particular 
algebraic varieties) is little understood.* In other words, the higher-dimensional 
version of the “near-misses” question of Mazur mentioned above has never been 
systematically considered. 


References 


1. B. ADAMCZEWSKI - “Sur l’exposant de densité des nombres algébriques”, Int. Math. Res. 
Not., article ID rnm024, 6 pages, 2007. 

2. B. ADAMCZEWSKIand Y. BUGEAUD — “Palindromic continued fractions.”, Ann. Inst. Fourier 
57 (2007), no. 5, p. 1557-1574. 

3. — , “Mesures de transcendance et aspects quantitatifs de la méthode de Thue—Siegel—Roth— 
Schmidt”, Proc. London Math. Soc. 101 (2010), 1-31. 

4. G. ALAIN — “Simultaneous approximation of a real number by all conjugates of an algebraic 
number.”, Acta Arith. 127 (2007), no. 1, p. 63-70. 


3One-dimensional Lebesgue measure on I”. 


4see Beresnevich: Rational points near manifolds and metric Diophantine approximation. Ann. of 
Math. (to appear). http://arxiv.org/abs/0904.0474. 


Rec 


18. 


19. 


20. 


21. 


22. 


23% 


24. 


25. 


26. 


Dike 


28. 


ent advances in Diophantine approximation 699 


. K. ALNIAGIK, Y. AVCI and Y. BUGEAUD — “On U,,-numbers with small transcendence 
measure”, Acta Math. Hungar. 99 (2003), no. 4, p. 271-277. 

.F AMoROSO — “Polynomials with high multiplicity”, Acta Arith. 56 (1990), no. 4, 
p. 345-364. 

. M. AMOU and Y. BUGEAUD — “On integer polynomials with multiple roots”, Mathematika 
54 (2007), no. 1-2, p. 83-92. 

. B. ARBOUR and D. Roy — “A Gel’fond type criterion in degree two”, Acta Arith. 111 (2004), 
no. I, p. 97-103. 

. A. BAKER — “On a theorem of Sprindzuk”, Proc. Royal Soc. Series A 292 (1966), p. 92-104. 

. — , Transcendental number theory, Cambridge University Press, London, 1975. 

. A. BAKER and W. M. SCHMIDT — “Diophantine approximation and Hausdorff dimension”, 
Proc. Lond. Math. Soc. 21 (1970), p. 1-11. 

. R. C. BAKER — “Dirichlet’s theorem on Diophantine approximation”, Math. Proc. Cam. Phil. 
Soc. 83 (1978), p. 37-59. 

. V. BERESNEVICH — “On approximation of real numbers by real algebraic numbers”, Acta 
Arith. 90 (1999), no. 2, p. 97-112. 

. — , “Distribution of rational points near a parabola”, Dokl. Nats. Akad. Nauk Belarusi 45 

(2001), no. 4, p. 21-23, 123. 

—., “A Groshev type theorem for convergence on manifolds”, Acta Math. Hungar. 94 (2002), 

no. 1-2, p. 99-130. 

. —, “On a theorem of V. Bernik in the metric theory of Diophantine approximation”, Acta 
Arith. 117 (2005), no. 1, p. 71-80. 

. V. BERESNEVICH, V. I. BERNIK and M. M. DODSON — “On the Hausdorff dimension of sets 

of well-approximable points on nondegenerate curves”, Dokl. Nats. Akad. Nauk Belarusi 46 

(2002), no. 6, p. 18-20, 124. 

— , “Regular systems, ubiquity and Diophantine approximation”, A panorama of number 

theory or the view from Baker’s garden (Ziirich, 1999), Cambridge Univ. Press, Cambridge, 

2002, p. 260-279. 

V. BERESNEVICH, V. I. BERNIK, D. Y. KLEINBOCK and G. A. MARGULIS — “Metric 

Diophantine approximation: the Khintchine-Groshev theorem for nondegenerate manifolds”, 

Mosc. Math. J. 2 (2002), no. 2, p. 203-225, Dedicated to Yuri I. Manin on the occasion of his 

65th birthday. 

V. BERESNEVICH, H. DICKINSON and S. VELANI - “Sets of exact ‘logarithmic’ order in the 

theory of Diophantine approximation”, Math. Ann. 321 (2001), no. 2, p. 253-273. 

— , “Measure theoretic laws for lim sup sets”, Mem. Amer. Math. Soc. 179 (2006), no. 846, 

p. x+91. 

— , “Diophantine approximation on planar curves and the distribution of rational points’, 

Ann. of Math. (2) 166 (2007), no. 3, p. 367-426. 

V. BERESNEVICH and S. VELANI-— “An inhomogeneous transference principle and Diophan- 

tine approximation’, Proc. Lond. Math. Soc. (3) 101 (2010), no. 3, 821-851. 

V. I. BERNIK — “An application of Hausdorff dimension in the theory of Diophantine 

approximation”, Acta Arith. 42 (1983), no. 3, p. 219-253, (In Russian). English transl. in 

Amer. Math. Soc. Transl. 140 (1988), 15-44. 

— , “On the exact order of approximation of zero by values of integral polynomials”, Acta 

Arith. 53 (1989), p. 17-28, (In Russian). 

V. I. BERNIK and M. M. DODSON — Metric Diophantine approximation on manifolds, 

Cambridge Tracts in Mathematics, vol. 137, Cambridge University Press, Cambridge, 1999. 

V. I. BERNIK, D. KLEINBOCK and G. A. MARGULIS — “Khintchine-type theorems on 

manifolds: the convergence case for standard and multiplicative versions”, Internat. Math. 

Res. Notices (2001), no. 9, p. 453-486. 

V. I. BERNIK and K. I. TISHCHENKO — “Integral polynomials with an overfall of the 

coefficient values and Wirsing’s theorem”, Dokl. Akad. Nauk Belarusi 37 (1993), no. 5, 

p. 9-11, 121 (1994). 


700 


50. 


53. 


54. 


M. Waldschmidt 


. Y. BILU — “The many faces of the subspace theorem [after Adamczewski, Bugeaud, Corvaja, 
Zannier. . . |”, Séminaire Bourbaki, 59e année (2006-2007), No 967, 2006. 

. E. BOMBIERI and W. GUBLER — Heights in Diophantine geometry, New Mathematical 
Monographs 4. Cambridge: Cambridge University Press, 2006. 

. Y. BUGEAUD — “Approximation par des nombres algébriques”, J. Number Theory 84 (2000), 

no. 1, p. 15-33. 

— , “Approximation by algebraic integers and Hausdorff dimension”, J. Lond. Math. Soc. 65 

(2002), p. 547-559. 

. —, “Approximation by algebraic integers and Hausdorff dimension.”, J. Lond. Math. Soc., 
I. Ser. 65 (2002), no. 3, p. 547-559. 

. — , “Mahler’s classification of numbers compared with Koksma’s”, Acta Arith. 110 (2003), 
no. I, p. 89-105. 

. — , “Sets of exact approximation order by rational numbers”, Math. Ann. 327 (2003), no. 1, 
p. 171-190. 

.— , Approximation by algebraic numbers, Cambridge Tracts in Mathematics, vol. 160, 
Cambridge University Press, Cambridge, 2004. 

. —, “Mahler’s classification of numbers compared with Koksma’s. IIT’, Publ. Math. Debrecen 
65 (2004), no. 3-4, p. 305-316. 

. —, “Diophantine approximation and Cantor sets.”, Math. Ann. 341 (2008), no. 3, p. 677-684. 

.— , “Mahler’s classification of numbers compared with Koksma’s. II”, Diophantine ap- 
proximation. Festschrift for Wolfgang Schmidt, vol. 16, 2008, Developments in Math, Eds: 
H. P. Schlickewei, K. Schmidt and R. Tichy, Springer-Verlag, p. 107-121. 

. Y. BUGEAUD and J.-H. EVERTSE — “Approximation of complex algebraic numbers by 
algebraic numbers of bounded degree”, Ann. Scuola Normale Superiore di Pisa, 8 (2009), 
333-368. 

. Y. BUGEAUD and M. LAURENT — “Exponents of Diophantine approximation and Sturmian 
continued fractions”, Ann. Inst. Fourier (Grenoble) 55 (2005), no. 3, p. 773-804. 

.— , “On exponents of homogeneous and inhomogeneous Diophantine approximation”, 
Moscow Math. J. 5 (2005), p. 747-766. 

. —, “Exponents of Diophantine approximation”, Diophantine Geometry proceedings, Scuola 
Normale Superiore Pisa, Ser. CRM 4 (2007), p. 101-121. 

.— , “On transfer inequalities in Diophantine approximation. II”, Math. Zeitschrift, 265 
(2010), 249-262. 

. Y. BUGEAUD and O. TEULIE — “Approximation d’un nombre réel par des nombres 
algébriques de degré donné”, Acta Arith. 93 (2000), no. 1, p. 77-86. 


. J. W. S. CASSELS — An introduction to Diophantine approximation, Cambridge Tracts in 


Mathematics and Mathematical Physics, No. 45, Cambridge University Press, New York, 
1957. 
. T. W. CUSICK and M. E. FLAHIVE — The Markoff and Lagrange spectra, Mathematical 
Surveys and Monographs, vol. 30, American Mathematical Society, Providence, RI, 1989. 
. H. DAVENPORT and W. M. SCHMIDT — “Approximation to real numbers by quadratic 
irrationals”, Acta Arith. 13 (1967), p. 169-176. 


.— , “Approximation to real numbers by algebraic integers”, Acta Arith. 15 (1969), 
p. 393-416. 
G. DIAZ — “Grands degrés de transcendance pour des familles d’exponentielles”, J. Number 


Theory 31 (1989), no. 1, p. 1-23. 


. —, “Une nouvelle propriété d’ approximation diophantienne”, C. R. Acad. Sci. Paris Sér. I 


Math. 324 (1997), no. 9, p. 969-972. 
. G. DIAZ and M. MIGNOTTE — “Passage d’une mesure d’approximation 4 une mesure de 
transcendance”, C. R. Math. Rep. Acad. Sci. Canada 13 (1991), no. 4, p. 131-134. 
H. DICKINSON and M. M. DODSON — “Extremal manifolds and Hausdorff dimension”, Duke 
Math. J. 101 (2000), no. 2, p. 271-281. 
M. M. DoDSON, B. P. RYNNE and J. A. G. VICKERS — “Metric Diophantine approximation 
and Hausdorff dimension on manifolds”, Math. Proc. Cam. Phil. Soc. 105 (1989), p. 547-558. 


Rec 


55 


56 


57. 


58. 


59. 


60. 


61. 


62. 


63. 


64. 


65. 


60. 


67. 


68. 


69. 


70. 


71. 


72. 


73. 


74. 


75. 
76. 


Eds 


78. 


79. 


80 


ent advances in Diophantine approximation 701 


. C. DRUTU — “Diophantine approximation on rational quadrics”, Math. Ann. 333 (2005), no. 2, 
p. 405-469. 

.R. J. DUFFIN and A. C. SCHAEFFER — “Khintchine’s problem in metric Diophantine 
approximation”, Duke Math. J. 8 (1941), p. 243-255. 

M. EINSIEDLER, A. KATOK and E. LINDENSTRAUSS — “Invariant measures and the set of 

exceptions to Littlewood’s conjecture”, Ann. of Math. (2) 164 (2006), no. 2, p. 513-560. 

N. ELKIES — “Rational points near curves and small nonzero |x* — y?| via lattice reduction”, 

Algorithmic number theory (Leiden, 2000), Lecture Notes in Comput. Sci., vol. 1838, 

Springer, Berlin, 2000, http://arxiv.org/abs/math/0005139, p. 33-63. 

N. I. FEL’DMAN and Y. V. NESTERENKO — “Transcendental numbers”, in Number Theory, 

IV, Encyclopaedia Math. Sci., vol. 44, Springer, Berlin, 1998, p. 1-345. 

N. L. FEL/DMAN and A. B. SIDLOVSKIi — “The development and present state of the theory 

of transcendental numbers”, Uspehi Mat. Nauk 22 (1967), no. 3 (135), p. 3-81. 

S. FISCHLER — “Spectres pour l’approximation d’un nombre réel et de son carré”, C. R. Math. 

Acad. Sci. Paris 339 (2004), no. 10, p. 679-682. 

— , “Palindromic prefixes and Diophantine approximation”, Monatshefte Math. 151 (2007), 

no. I, p. 1-87. 

S. FISCHLER and T. RIVOAL — “Un exposant de densité en approximation rationnelle”, 

International Mathematics Research Notices (2006), no. 24, p. 48, Article ID 95418. 

A. O. GEL’FOND — “On the algebraic independence of algebraic powers of algebraic 

numbers”, Doklady Akad. Nauk SSSR (N.S.) 64 (1949), p. 277-280. 

—, “On the algebraic independence of transcendental numbers of certain classes”, Doklady 

Akad. Nauk SSSR (N.S.) 67 (1949), p. 13-14. 

— , “On the algebraic independence of transcendental numbers of certain classes”, Uspehi 

Matem. Nauk (N.S.) 4 (1949), no. 5(33), p. 14-48. 

— , “The approximation of algebraic numbers by algebraic numbers and the theory of 

transcendental numbers”, Amer. Math. Soc. Translation (1952), no. 65, p. 45. 

— , Transcendental and algebraic numbers, Translated by Leo F. Boron from the first Russian 

edition “Transcendentnye i algebraigeskie Cisla”, Gosudarstv. Izdat. Tehn.-Teor. Lit., Moscow 
1952, Dover Publications Inc., New York, 1960. 

G. HARMAN - Metric number theory., vol. 18, London Mathematical Society Monographs. 

New Series, Oxford: Clarendon Press, 1998. 

P.-C. HU and C.-C. YANG — Distribution theory of algebraic numbers, de Gruyter Expositions 

in Mathematics, vol. 45, Walter de Gruyter GmbH & Co. KG, Berlin, 2008. 

M. N. HUXLEY — “The rational points close to a curve”, Ann. Scuola Norm. Sup. Pisa Cl. Sci. 

(4) 21 (1994), no. 3, p. 357-375. 

V. JARNIK — “Zum Khintchineschen “Ubertragungssatz”, Acad. Sci. URSS., Fil. Géorgienne, 

Trav. Inst. math., Thilissi, 3 (1938), p. 193-216 (German). 

A. Y. KHINTCHINE — “Einige Siatze tiber Kettenbriiche, mit Anwendungen auf die Theorie 

der Diophantischen Approximationen”, Math. Ann. 92 (1924), p. 115-125. 

— , “Uber eine Klasse linearer Diophantischer Approximationen”, Rendiconti Palermo, 50 

(1926), p. 170-195. 

— , Continued fractions, Dover Publications Inc., Mineola, NY, 1964. 

D. Y. KLEINBOCK and G. A. MARGULIS — “Flows on homogeneous spaces and Diophantine 

approximation on manifolds”, Ann. of Math. (2) 148 (1998), no. 1, p. 339-360. 

D. KLEINBOCK and G. TOMANOV -— “Flows on S-arithmetic homogeneous spaces and 

applications to metric Diophantine approximation.”, Comment. Math. Helv. 82 (2007), no. 3, 

p. 519-581. 

S. LANG — “Report on Diophantine approximations”, Bull. Soc. Math. France 93 (1965), 

p. 177-192, = [84] p. 326-341. 

— , Introduction to transcendental numbers, Addison-Wesley Publishing Co., Reading, 

Mass.-London-Don Mills, Ont., 1966, = [84] p. 396-506. 

. —, “Transcendental numbers and Diophantine approximations”, Bull. Amer. Math. Soc. 77 
(1971), p. 635-677, = [85] p. 1-43. 


702 


81. 


82. 


100. 


101. 


102. 


103. 


104. 


105. 


106. 


107. 


M. Waldschmidt 


— , “Higher dimensional Diophantine problems”, Bull. Amer. Math. Soc. 80 (1974), 
p. 779-787, = [85] p. 102-110. 

— , “Old and new conjectured Diophantine inequalities”, Bull. Amer. Math. Soc. (N.S.) 23 
(1990), no. 1, p. 37-75, = [86] p. 355-393. 


. — , Introduction to Diophantine approximations, second éd., Springer-Verlag, New York, 


1995. 


. — , Collected papers. Vol. I, Springer-Verlag, New York, 2000, 1952-1970. 

. —, Collected papers. Vol. II, Springer-Verlag, New York, 2000, 1971-1970. 

. — , Collected papers. Vol. III, Springer-Verlag, New York, 2000, 1978-1990. 

. M. LAURENT — “New methods in algebraic independence”, Gyéry, Kalman (ed.) et al., 


Number theory. Diophantine, computational and algebraic aspects. Proceedings of the 
international conference, Eger, Hungary, July 29-August 2, 1996, Berlin: de Gruyter. 311- 
330 , 1998. 


. —, “Simultaneous rational approximation to the successive powers of a real number”, /ndag. 


Math. (N.S.) 14 (2003), no. 1, p. 45-53. 


. —, “On inhomogeneous Diophantine approximation and Hausdorff dimension’, preprint, 


Proceedings of a conference dedicated to Gelfond, Moscow, 9 pages, 2007. 


. —, “Exponents of Diophantine Approximation in dimension two”, Michel Waldschmidt 6 


octobre 2011 16:32. 


. —, “On transfer inequalities in Diophantine approximation”, in “Analytic Number Theory - 


Essays in Honour of Klaus Roth’, Cambridge University Press, 2009, p. 306-314. 


. M. LAURENT and D. Roy — “Criteria of algebraic independence with multiplicities and 


interpolation determinants”, Trans. Amer. Math. Soc. 351 (1999), no. 5, p. 1845-1870. 


.— , “Sur l’approximation algébrique en degré de transcendance un”, Ann. Inst. Fourier 


(Grenoble) 49 (1999), no. 1, p. 27-55. 


. —, “Criteria of algebraic independence with multiplicities and approximation by hypersur- 


faces”, J. reine angew. Math. 536 (2001), p. 65-114. 


. J. LEVESLEY, C. SALP and S. VELANI — “On a problem of K. Mahler: Diophantine 


approximation and Cantor sets”, Math. Ann. 338 (2007), no. 1, p. 97-118. 


.K. MAHLER — “Uber das Maf der Menge aller S-Zahlen”, Math. Ann. 106 (1932), 


p. 131-139. 


. —, “On the order function of a transcendental number’, Acta Arith. 18 (1971), p. 63-76. 
. B. MAZUR — “Perturbations, deformations, and variations (and “near-misses”) in geometry, 


physics, and number theory”, Bull. Amer. Math. Soc. (N.S.) 41 (2004), no. 3, p. 307-336 
(electronic). 


.S. V. MIKHAILOV — “A transcendence type for almost all points in an m-dimensional real 


space”, Mat. Sb. 198 (2007), no. 10, p. 67-88. 

Y. V. NESTERENKO — “An order function for almost all numbers”, Mat. Zametki 15 (1974), 
p. 405-414. 

— , “Algebraic independence of algebraic powers of algebraic numbers”, Mat. Sb. (N.S.) 
123(165) (1984), no. 4, p. 435-459. 

—, “The measure of algebraic independence of values of certain functions”, Mat. Sb. (N.S.) 
128(170) (1985), no. 4, p. 545-568, 576. 

Y. NESTERENKO and P. PHILIPPON (éds.) — Jntroduction to algebraic independence theory, 
Lecture Notes in Mathematics, vol. 1752, Springer-Verlag, Berlin, 2001. 

P. PHILIPPON — “Critéres pour l’indépendance algébrique”, Inst. Hautes Etudes Sci. Publ. 
Math. 64 (1986), p. 5-52. 

—, “Une approche méthodique pour la transcendance et l’indépendance algébrique de valeurs 
de fonctions analytiques”, J. Number Theory 64 (1997), no. 2, p. 291-338. 

—., “Approximations algébriques des points dans les espaces projectifs. I’, J. Number Theory 
81 (2000), no. 2, p. 234-253. 

— , “Criteria for algebraic independence”, Introduction to algebraic independence theory, 
Lecture Notes in Math., Vol. 1752, Springer, Berlin, 2001, Chap. 8 in [103], p. 133-141. 


Rec 


108 


109. 


110. 


111. 


112. 


114. 


ent advances in Diophantine approximation 703 


. A. POLLINGTON AND R. VAUGHAN - “The k-dimensional Duffin and Schaeffer conjecture”, 

Mathematika 37 (1990), p. 190-200. 

G. REMOND — “Elimination multihomogéne”, in Introduction to algebraic independence 

theory, Lecture Notes in Math., Vol. 1752, Springer, Berlin, 2001, Chap. 5 in [103], p. 53-81. 

D. Roy — “Approximation algébrique simultanée de nombres de Liouville”, Canad. Math. 

Bull. 44 (2001), no. 1, p. 115-120. 

—, “An arithmetic criterion for the values of the exponential function”, Acta Arith. 97 (2001), 

no. 2, p. 183-194. 

—, “Une formule d’interpolation en deux variables”, J. Théor. Nombres Bordeaux 13 (2001), 

no. 1, p. 315-323, 21€ Journées Arithmétiques (Rome, 2001). 

. —, “Approximation simultanée d’un nombre et de son carré”, C. R. Math. Acad. Sci. Paris 

336 (2003), no. 1, p. 1-6. 

— , “Approximation to real numbers by cubic algebraic integers. II”, Ann. of Math. (2) 158 

(2003), no. 3, p. 1081-1087. 

. —, “Approximation to real numbers by cubic algebraic integers. I”, Proc. London Math. Soc. 
(3) 88 (2004), no. 1, p. 42-62. 

. —, “Diophantine approximation in small degree”, Number theory, CRM Proc. Lecture Notes, 
vol. 36, Amer. Math. Soc., Providence, RI, 2004, p. 269-285. 

. —, “Simultaneous approximation by conjugate algebraic numbers in fields of transcendence 
degree one”, Int. J. Number Theory 1 (2005), no. 3, p. 357-382. 

. —, “On two exponents of approximation related to a real number and its square”, Canadian 
J. Math. 59 (2007), no. 1, p. 211-224, 

. —, “On the continued fraction expansion of a class of numbers”, Diophantine approximation. 
Festschrift for Wolfgang Schmidt, vol. 16, 2008, in: Developments in Math, Eds: H. P. Schlick- 
ewei, K. Schmidt and R. Tichy, Springer-Verlag, p. 347-361. 

. —, “Small value estimates for the mutiplicative group”, Acta Arith. 135.4 (2008), p. 357-393. 

. — , “Small value estimates for the additive group”, Int. J. Number Theory, 6 (2010), no. 4, 
919-956. 

.D. Roy and M. WALDSCHMIDT — “Diophantine approximation by conjugate algebraic 
integers”, Compos. Math. 140 (2004), no. 3, p. 593-612. 

. W. M. SCHMIDT — “On heights of algebraic subspaces and Diophantine approximations”, 
Ann. of Math. (2) 85 (1967), p. 430-472. 

. — , Diophantine approximation, Vol.785, Lecture Notes in Mathematics. Berlin-Heidelberg- 
New York: Springer-Verlag, 1980. 

. — , Diophantine approximations and Diophantine equations, Lecture Notes in Mathematics, 
vol. 1467, Springer-Verlag, Berlin, 1991. 

. —, “Mahler and Koksma classification of points in R” and C””, Functiones et Approximatio 
35 (2006), p. 307-319. 

. — , “Diophantine approximation by algebraic hypersurfaces and varieties”, Trans. Amer. 
Math. Soc. 359 (2007), no. 5, p. 2221-2241. 

. T. SCHNEIDER — Einfiihrung in die transzendenten Zahlen, Springer-Verlag, Berlin, 1957, 
French Transl., Introduction aux nombres transcendants, Gauthier-Villars, Paris (1959). 

. V.G. SPRINDZUK — Mahler’s problem in metric number theory, Translated from the Russian 
by B. Volkmann. Translations of Mathematical Monographs, Vol. 25, American Mathematical 
Society, Providence, R.I., 1969. 

. —, Metric theory of Diophantine approximations, V. H. Winston & Sons, Washington, D.C., 
1979. 

.R. TINDEMAN — “On the algebraic independence of certain numbers”, Nederl. Akad. 
Wetensch. Proc. Ser. A 74=Indag. Math. 33 (1971), p. 146-162. 

. K. I. TISHCHENKO — “On new methods in the problem of the approximation of real numbers 
by algebraic numbers of degree at most three”, Vests = 1 Nats. Akad. Navuk Belarus = 1 Ser. 
F=1z.-Mat. Navuk 4 (2000), p. 26-31, 141. 


3. — , “On some special cases of the Wirsing conjecture”, Vests i Nats. Akad. Navuk Belarus = 


1 Ser. F = 1z.-Mat. Navuk 3 (2000), p. 47-52, 140. 


704 


134. 


135. 


136. 


137. 


138. 


139. 


140. 


M. Waldschmidt 


—., “On the application of linearly independent polynomials in the Wirsing problem’, Dokl. 
Nats. Akad. Nauk Belarusi 44 (2000), no. 5, p. 34-36, 124. 

—, “On the approximation of real numbers by algebraic integers of the third degree”, Dokl. 
Nats. Akad. Nauk Belarusi 45 (2001), no. 1, p. 17-19, 136. 

— , “On approximation of real numbers by algebraic numbers of bounded degree”, J. Number 
Theory 123 (2007), no. 2, p. 290-314. 

R. C. VAUGHAN and S. VELANI — “Diophantine approximation on planar curves: the 
convergence theory”, Invent. Math. 166 (2006), no. 1, p. 103-124. 

M. WALDSCHMIDT — Diophantine approximation on linear algebraic groups Transcendence 
properties of the exponential function in several variables, Grundlehren der Mathematischen 
Wissenschaften, vol. 326, Springer-Verlag, Berlin, 2000. 

E. WIRSING — “Approximation mit algebraischen Zahlen beschraénkten Grades”, J. reine 
angew. Math. 206 (1960), p. 67-77. 

K. R. Yu — “A generalization of Mahler’s classification to several variables”, J. Reine angew. 
Math. 377 (1987), p. 113-126. 


